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FLEA HEAD, NERVE CORD, HIND GUT AND MALPIGHIAN TUBULE 
NUCLEIC ACID MOLECULES, PROTEINS AND USES THEREOF 



CROSS REFERENCE TO RELATED APPLICATIONS 
This application claims priority to U.S. Provisional Patent Application Serial 
Wber 60/128,704, entitled "NOVEL FLEA HEAD, NERVE CORD, HINDGUT AND 
MALPIGHIAN TUBULeViUCLEIC ACID MOLECULES, PROTEINS AND USES 
THEREOF", filed April 9, 195(9. 

FIELD OF THE INVENTION 
The present invention relates to nucleic acid molecules isolated from the head and 
10 nerve cord of a flea, nucleic acid molecules isolated from the hindgut and Malpighian 
tubule of a flea, proteins encoded by such nucleic acid molecules, antibodies raised 
against such proteins, and inhibitors of such proteins. The present invention also includes 
therapeutic compositions comprising such nucleic acid molecules, proteins, antibodies, 
and/or other inhibitors, as well as uses thereof. 
1 5 BACKGROUND OF THE INVENTION 

Flea infestation of animals is a health and economic concern because fleas are 
known to cause and/or transmit a variety of diseases. Fleas directly cause a variety of 
diseases, including allergies, and also carry a variety of infectious agents including, but 
not limited to, endoparasites (e.g., nematodes, cestodes, trematodes and protozoa), 
. 20 bacteria and viruses. In particular, the bites of fleas are a problem for animals maintained 
as pets because the infestation becomes a source of annoyance not only for the pet but 
also for the pet owner who may find his or her home generally contaminated with insects. 
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As such, fleas are a problem not only when they are on an animal but also when they are 
in the general environment of the animal. 

Bites from fleas are a particular problem because they not only can lead to disease 
transmission but also can cause a hypersensitive response in animals which is manifested 
as disease. For example, bites from fleas can cause an allergic disease called flea allergic 
(or allergy) dermatitis (FAD). A hypersensitive response in animals typically results in 
localized tissue inflammation and damage, causing substantial discomfort to the animal. 

The medical importance of flea infestation has prompted the development of 
reagents capable of controlling flea infestation. Commonly encountered methods to 
control flea infestation are generally focused on use of insecticides. While some of these 
products are efficacious, most, at best, offer protection of a very limited duration. 
Furthermore, many of the methods are often not successful in reducing flea populations. 
In particular, insecticides have been used to prevent flea infestation of animals by adding 
such insecticides to shampoos, powders, collars, sprays, spot-on formulations foggers and 
liquid bath treatments (i.e., dips). Reduction of flea infestation on the pet has been 
unsuccessful for one or more of the following reasons: failure of owner compliance 
(frequent administration is required); behavioral or physiological intolerance of the pet to 
the pesticide product or means of administration; and the emergence of flea populations 
resistant to the prescribed dose of pesticide. 

Thus, there remains a need to develop a reagent and a method to protect animals 
from flea infestation. 
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SUMMARY OF THE INVENTION 
The present invention relates to a novel product and process for protection of 
animals from flea infestation. 

The present invention provides flea head and nerve cord (HNC) proteins and flea 
5 hindgut and Malpighian tubule (HMT) proteins; nucleic acid molecules encoding flea 
HNC proteins and flea HMT proteins; antibodies raised against such proteins (i.e., anti- 
flea HNC antibodies and anti-flea HMT antibodies respectively); mimetopes of such 
proteins or antibodies; and compounds that inhibit flea HNC or HMT activity (i.e, 
inhibitory compounds or inhibitors). 
10 The present invention also includes methods to obtain such proteins, mimetopes, 

nucleic acid molecules, antibodies and inhibitory compounds. The present invention also 
includes the use of proteins and antibodies to identify such inhibitory compounds as well 
as assay kits to identify such inhibitory compounds. Also included in the present 
invention are therapeutic compositions comprising proteins, mimetopes, nucleic acid 
1 5 molecules, antibodies and inhibitory compounds of the present invention including 
protective compounds derived from a protein of the present invention that inhibit the 
activity of HNC and/or HMT proteins; also included are uses of such therapeutic 
compounds to reduce flea infestation. 

One embodiment of the present invention is an isolated nucleic acid molecule that 
20 hybridizes with a nucleic acid sequence having SEQ ID NO:l, SEQ ID NO:3, SEQ ID 
NO:4, SEQ ID NO:6, SEQ ID NO:7, SEQ ID NO:9, SEQ ID NO: 10, SEQ ID NO: 12, 
SEQ ID NO:13, SEQ ID NO:15, SEQ ID NO:16, SEQ ID NO: 18, SEQ ID NO:19, SEQ 
ID NO:21, SEQ ID NO:22, SEQ ID NO:24, SEQ ID NO:25, SEQ ID NO:27, SEQ ID 
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NO:28, SEQ ID NO:30, SEQ ID N0:3 1 , SEQ ID NO:33, SEQ ID NO:34, SEQ ID 
NO:36, SEQ ID NO:37, SEQ ID NO:39, SEQ ID NO:40, SEQ ID NO:42, SEQ ID 
NO:43, SEQ ID NO:45, SEQ ID NO:46, SEQ ID NO:48, SEQ ID NO: 153, SEQ ID 
NO: 155, SEQIDNO:156, SEQ IDNO:158, SEQ ID NO: 159, SEQ ID NO: 161, SEQ ID 

5 NO: 162, SEQ ID NO: 164, SEQ ID NO: 165, SEQ ID NO: 167, SEQ ID NO: 168, SEQ ID 
NO:170, SEQIDNO:1859, SEQ IDNO:1860, SEQ IDNO:1861, SEQ ID NO: 1863, 
SEQ ID NO: 1 864, SEQ ID NO: 1 866, SEQ ID NO: 1 867, SEQ ID NO: 1 869, SEQ ID 
NO: 1 870, SEQ ID NO : 1 87 1 , SEQ ID NO: 1 872, SEQ ID NO: 1 874, SEQ ID NO: 1 875 , 
SEQ ID NO: 1876, SEQ ED NO: 1877, SEQ ID NO: 1878, SEQ ID NO: 1880, SEQ ID 

10 NO: 1881, SEQ ID NO: 1882, SEQ ID NO: 1884, SEQ ID NO: 1885, SEQ ID NO: 1886, 
SEQ ID NO: 1887, SEQ ED NO: 1889, SEQ ID NO: 1890, SEQ ID NO: 1891, SEQ ID 
NO:1892, SEQ ID NO:1893, SEQ ID NO:1894, SEQ ID NO:1895, SEQ ID NO:1896, 
SEQ ID NO: 1898, SEQ ID NO: 1899, SEQ ID NO: 1900, SEQ ID NO: 1901, SEQ ID 
NO: 1903, SEQ ID NO: 1904, SEQ ID NO: 1905, SEQ ID NO: 1906, SEQ ID NO: 1907, 

15 SEQ ED NO: 1908, SEQ ID NO: 1909, SEQ ID NO: 19 10, SEQ ID NO: 191 1, SEQ ED 
NO:1912, SEQ ID NO:1913, SEQ ID NO:1914, SEQ ID NO:1916, SEQ ED NO:1917, 
SEQIDNO:1918, SEQIDNO:1919, SEQ ED NO: 1921, SEQ ED NO: 1922, SEQ ID 
NO: 1923, SEQ ID NO: 1924, SEQ ED NO: 1926, SEQ ID NO: 1927, SEQ ID NO: 1928, 
SEQ ID NO:1929, and/or SEQ ID NO:193 1 under conditions that allow less than or equal 

20 to about 30% base pair mismatch. Another embodiment of the present invention is an 
isolated nucleic acid molecule that hybridizes with a nucleic acid molecule selected from 
the group consisting of a nucleic acid sequence of Table I, Table II, Table III and/or 
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Table IV, or a nucleic acid sequence complementary to a nucleic acid sequence of 
Table I, Table n, Table III and/or Table IV under conditions that allow less than or equal 
to about 30% base pair mismatch. 

Another embodiment of the present invention is an isolated nucleic acid molecule 

5 having nucleic acid sequence that is at least about 70% identical to SEQ ID NO: 1, SEQ 
ID NO:3, SEQ ID NO:4, SEQ ID NO:6, SEQ ID NO:7, SEQ ID NO:9, SEQ ID NO: 10, 
SEQ ID NO: 12, SEQ ID NO: 13, SEQ ID NO: 15, SEQ ID NO: 16, SEQ ID NO: 18, SEQ 
ID NO: 19, SEQ ID NO:21, SEQ ID NO:22, SEQ ID NO:24, SEQ ID NO:25, SEQ ID 
NO:27, SEQ ID NO:28, SEQ ID NO:30, SEQ ID NO:31 s SEQ ID NO:33, SEQ ID 

10 NO:34, SEQ ID NO:36, SEQ ID NO:37, SEQ ID NO:39, SEQ ID NO:40, SEQ ID 
NO:42, SEQ ID NO:43, SEQ ID NO:45, SEQ ID NO:46, SEQ ID NO:48, SEQ ID 
NO:153, SEQ IDNO:155, SEQ IDNO:156, SEQIDNO:158, SEQ ID NO: 159, SEQ ID 
NO:16l, SEQ ID NO:162, SEQ ID NO:164, SEQ ID NO: 165, SEQ ID NO:167, SEQ ID 
NO:168, SEQ ID NO:170, SEQ ID NO:1859, SEQ ID NO:1860, SEQ ID NO:1861, SEQ 

15 IDNO:1863, SEQ ID NO:1864, SEQ ID NO: 1866, SEQ ID NO:1867, SEQ ID N0.1869, 
SEQ ID NO: 1870, SEQ ID NO:1871, SEQ ID NO: 1872, SEQ ID NO: 1874, SEQ ID 
NO: 1 875, SEQ ID NO: 1 876, SEQ ID NO: 1 877, SEQ ID NO: 1 878, SEQ ID NO: 1 880, 
SEQ ID NO: 1881, SEQ ID NO: 1882, SEQ ID NO: 1884, SEQ ID NO: 1885, SEQ ID 
NO:1886, SEQ ID NO:1887, SEQ ID NO:1889, SEQ ID NO:1890, SEQ IDNO:1891, 

20 SEQ ID NO: 1892, SEQ ID NO: 1893, SEQ ID NO: 1894, SEQ ID NO: 1895, SEQ ID 
NO: 1896, SEQ ID NO: 1898, SEQ ID NO: 1899, SEQ ID NO: 1900, SEQ ID NO: 1901, 
SEQ ID NO: 1903, SEQ ID NO: 1904, SEQ ID NO: 1905, SEQ ID NO: 1906, SEQ ID 



-5- 



FC-6-C1 

NO:1907, SEQ ID NO:1908, SEQ ID NO:1909, SEQ ID NO:1910, SEQ ID N0:191 1, 
SEQIDNO:1912, SEQ ID NO:1913, SEQ IDNO:1914, SEQIDNO:1916, SEQ ID 
NO: 1917, SEQ ID NO: 19 18, SEQ ID NO: 1919, SEQ ID NO: 1921, SEQ ID NO: 1922, 
SEQ ID NO: 1923, SEQ ID NO: 1924, SEQ ID NO: 1926, SEQ ID NO: 1927, SEQ ID 
5 NO: 1928, SEQ ID NO: 1929, and/or SEQ ID NO: 193 1 and/or a nucleic acid sequence of 
Table I, Table II, Table HI and/or Table IV or complements thereof. 
□ The present invention also relates to recombinant molecules, recombinant viruses 

y3 and recombinant cells that include a nucleic acid molecule of the present invention. Also 

^ included are methods to produce such nucleic acid molecules, recombinant molecules, 

10 recombinant viruses and recombinant cells. 

Another embodiment of the present invention includes an isolated flea HMT 
ftp and/or HNC protein that is at least about 70% identical to an amino acid sequence 

O selected from the group consisting of SEQ ID NO:2, SEQ ID NO:8, SEQ ID NO: 14, SEQ 

ID NO:20, SEQ ID NO:26, SEQ ID NO:32, SEQ ID NO:38, SEQ ED NO:44, SEQ ID 
15 NO: 154, SEQ ID NO: 160, SEQ ID NO: 163, SEQ ID NO: 169, SEQ ED NO: 1862, SEQ 
ED NO:1868, SEQ ED NO:1873, SEQ ED NO: 1879, SEQ ED NO:1883, SEQ ED N0.1888,. 
SEQ ED NO:1897, SEQ ED NO: 1902, SEQEO NO:1915, SEQ ED NO: 1920, SEQ ED 
NO: 1925, and/or SEQ ED NO: 1930, and/or an amino acid sequence encoded by a nucleic 
acid sequence of Table I, Table II, Table EG and/or Table IV, and fragments thereof, 
20 wherein such fragments can elicit an immune response against respective flea proteins or 
have activity comparable to respective flea proteins. 
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Another embodiment of the present invention includes an isolated protein 
encoded by a nucleic acid molecule that hybridizes with the complement of a nucleic acid 
sequence having SEQ ID NO:l, SEQ ID NO:4, SEQ ID NO:7, SEQ ID NO: 10, SEQ ID 
NO:13, SEQ ID NO:16, SEQ ID NO:19, SEQ ID NO:22, SEQ ID NO:25, SEQ ID 
5 NO:28, SEQ ID NO:31, SEQ ID NO:34, SEQ ID NO:37, SEQ ID NO:40, SEQ ID 
NO:43, SEQ ID NO:46, SEQ ID NO: 153, SEQ ID NO: 156, SEQ ID NO: 159, SEQ ID 
5 NO: 162, SEQ ID NO: 165, SEQ ID NO: 168, SEQ ID NO: 1859, SEQ ID NO: 1861, SEQ 

J ID NO: 1864, SEQ ID NO: 1867, SEQ ID NO: 1 870, SEQ ID NO: 1872, SEQ ID NO: 1875, 

S SEQ ID NO: 1 877, SEQ ID NO: 1 878, SEQ ID NO: 1 88 1 , SEQ ID NO: 1 882, SEQ ID 

fi 10 NO:1885,SEQEDNO:1887,SEQIDNO:1890,SEQIDNO:1892,SEQE)NO:1894, 

^ SEQ ID NO: 1896, SEQ ID NO: 1899, SEQ ID NO: 1901, SEQ ID NO: 1904, SEQ ID 

Pf NO:1906,SEQIDNO:1908,SEQIDNO:1910,SEQIDNO:1912,SEQIDNO:1914, 
g SEQ ID NO:1917, SEQ ID NO:1919, SEQ ID NO:1922, SEQ ID NO:1924, SEQ ID 

NO: 1927, and/or SEQ ID NO: 1929 and/or a nucleic acid sequence of Table I, Table II, 
15 Table IH and/or Table IV, under conditions that allow less than or equal to about 30% 
base pair mismatch. 

DETAILED DESCRIPTION OF THE INVENTION 
The present invention provides for nucleic acid molecules isolated from the head 
and/or nerve cord of a flea, nucleic acid molecules isolated from the hindgut and/or 
20 Malpighian tubule of a flea, proteins encoded by such nucleic acid molecules, antibodies 
raised against such proteins, and inhibitors of such proteins. As used herein, nucleic acid 
molecules isolated from the head and/or nerve cord of a flea and proteins encoded by 



-7- 



FC-6-C1 

such nucleic acid molecules are also referred to as flea HNC, or HNC, nucleic acid 
molecules and proteins respectively; and nucleic molecules isolated from the hindgut 
and/or Malpighian tubules of a flea and proteins encoded by such nucleic acid molecules 
are referred to as flea HMT or HMT, nucleic acid molecules and proteins respectively. 
HNC nucleic acid molecules and HMT nucleic acid molecules of the present invention 
are nucleic acid molecules that are primarily expressed in flea HNC tissues and HMT 
tissues respectively, but which may be expressed in cells derived from flea tissues other 
than HNC and HMT. HNC and HMT nucleic acid molecules and proteins of the present 
invention can be isolated from a flea or prepared recombinantly or synthetically. HMT 
and HNC nucleic acid molecules of the present invention can be RNA or DNA; examples 
of nucleic acid molecules include, but are not limited to, complementary DNA (cDNA) 
molecules, genomic DNA molecules, synthetic DNA molecules, DNA molecules which 
are specific tags for messenger RNA derived from HMT and HNC tissues, and 
corresponding mRNA molecules. As used herein, the phrases "HMT and/or HNC 
protein" and "HMT and HNC protein" refer to a protein expressed by a flea HMT tissue, 
by a flea HNC tissue, or by both flea HMT and HNC tissues. As used herein, the phrases 
"HMT and/or HNC nucleic acid molecule" and "HMT and HNC nucleic acid molecule" 
refer to a nucleic acid molecule that can be isolated from a HMT cDNA library, from a 
HNC cDNA library, or from both libraries, or a gene corresponding thereto. 

The present invention provides for nucleic acid molecules containing partial or 
full-length coding regions that encode one or more of the following flea proteins: an 
allantoinase (ALN) protein, a chitin-binding protein (CBP) protein, a sodium/potassium 
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ATPase beta subunit (NKAB) protein, a ligand-gated chloride channel (LGIC) protein, an 
ANON/23DA (ANON) protein, a malvolio (MALV) protein, an odorant-binding protein- 
like (OS-D) protein, a N-methyl-D-aspartate receptor associated (NMD A) protein, a 
chemical sense related lipophilic ligand binding protein-like (CLBP) protein, a 
Sodium/Hydrogen Transporter-like (NAH) protein, a Chloride Intracellular Channel-like 
(CLIC) protein, aPeritrophin-like (PL2) protein, aPeritrophin-like (PL3) protein, 
aPeritrophin-like (PL4) protein, a synaptic vesicle 2B-like (SVP) protein, a voltage-gated 
Chloride-like (VGCC) protein, an anoxia upregulated protein-like (AUP) protein, and a 
neuroendocrine specific 7B2-like (7B2) protein. Such nucleic acid molecules are referred 
to as ALN nucleic acid molecules, CBP nucleic acid molecules, NKAB nucleic acid 
molecules, LGIC nucleic acid molecules, ANON nucleic acid molecules, MALV nucleic 
acid molecules, OS-D nucleic acid molecules, NMDA nucleic acid molecules, CLBP 
nucleic acid molecules, NAH nucleic acid molecules, CLIC nucleic acid molecules, PL2 
nucleic acid molecules, PL3 nucleic acid molecules, PL4 nucleic acid molecules, SVP 
nucleic acid molecules, VGCC nucleic acid molecules, AUP nucleic acid molecules, and 
7B2 nucleic acid molecules respectively and are described herein in detail below. 

Allantoinase is involved in the catalysis of the reaction converting allantoin to 
allantoic acid. This is a middle step in purine catabolism, which in insects results in the 
secretion of urea as the end product. The enzyme is located in the peroxisomes of the 
liver and kidney in amphibians. There is no known mammalian homologue to 
allantoinase, as mammals secrete uric acid, a precursor to allantoin. As such, flea 
allantoinase represents a novel target for anti-flea vaccines and chemotherapeutic drugs. 
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The function of chitin binding protein is largely unknown. A chitinase-like 
protein of Bombyx mori (GenBank accession # 1841851) is reported to have weak 
similarity with the chitin-binding domain of insect chitinases; however, it has no 
significant similarity to the catalytic regions of known chitinases, and therefore is not 
expected to have chitinase activity. The chitinase-like protein of B. mori is also similar to 
the peritrophin family of proteins located in the peritrophic matrix of insects. These 
proteins contain putative chitin-binding domains but have no other apparent homology to 
any known proteins. Without being bound by theory, it is believed that these proteins 
bind chitin and are a structural component of the peritrophic matrix. As such, flea chitin 
binding protein represents a novel target for anti-flea vaccines and chemotherapeutic 
drugs. 

Na+/K+ATPase is involved in the hydrolysis of ATP to power the transport of 
Na-h out of and K+ into cells. It is responsible for establishing the Na+ gradient across 
plasma membranes, which is then used by cells for a number of functions including sugar 
and amino acid transport, diuresis and nerve cell signaling. The Na+/K+ ATPase pump is 
a trimer of a 100-kilodalton (kDa) alpha (a) subunit, a 40-kDa beta (p) subunit, and a 6- 
kDa gamma (y) subunit. Most insects express three isotypes of the p subunit, each being 
expressed in a tissue and cell-type dependent manner. The a subunit has 8 
transmembrane domains whereas the p and y subunits have just one. The a subunit 
mediates ATPase and ion transporting activities and together with the y subunit 
comprises the site for cardiac glycoside (ouabain) binding. The P subunit is required for 
detectable pump activity, and is thought to have roles in stability, localization, and 
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determining cation specificity. As such, a flea NKAB protein of the present invention 
represents a novel target for anti-flea vaccines and chemotherapeutic drugs. 

Ligand-gated ion channel family proteins have been shown to transmit neural 
signals in response to binding neurotransmitters such as GABA, glycine, and glutamate. 
GAB A and glycine receptors transmit inhibitory signals whereas glutamate receptors 
transmit excitatory signals. This family of proteins is the target for many drugs affecting 
neural signaling, and also for several families of insecticides including cyclodienes, 
pyrethroids, and phenyl pyrazoles. Northern blot analysis indicates that the mRNA 
corresponding to a LGIC nucleic acid molecule of the present invention is only expressed 
in HMT tissue, which suggests a role in the regulation or mediation of diuresis. Without 
being bound by theory, assuming protein expression correlates with the mRNA 
expression, flea LGIC may represent the first of this family of receptors shown to be 
exclusively expressed in renal tissue. Sequence analysis shows that a flea LGIC protein 
is distinct from other subfamilies of ligand-gated ion channels, and thus may represent a 
new subfamily. As such, a flea LGIC protein of the present invention represents a novel 
target for anti-flea vaccines and chemotherapeutic drugs. 

The function of ANON/23DA protein largely unknown. The ANON/23DA gene 
is reported to be linked to the MAD gene in Drosophila, though it is not known if 
ANON/23DA and MAD are functionally related. ANON/23DA may also have functional 
similarity to human probable membrane receptor protein pHPSl-2, which is similar to 
rhodopsiiufceta-adrenergic receptor which plays an important role in kidney function. As 
such, a flea ANON/23DA protein of the present invention represents a novel target for 
anti-flea vaccines and chemotherapeutic drugs. 
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Drosophila malvolio shows high sequence homology to mammalian natural 
resistance associated proteins (NRAMPs) and to yeast Smfl, which are proteins that 
transport divalent cations, specifically Mn++, Zn++, and Fe+-k NRAMPs have also been 
shown be similar to ATPase transporters and use ATP as an energy source. There are two 

5 types of NRAMP proteins, NRAMP1 and NRAMP2. NRAMP1 is expressed exclusively 
on macrophages and is responsible for preventing intracellular replication of microbes. 
NRAMP2 is expressed in several cell and tissue types, including mouse intestinal 
epithelia. Flea malvolio proteins of the present invention appear to be most similar to 
NRAMP 1. As such, a flea malvolio protein of the present invention represents a novel 

10 target for anti-flea vaccines and chemotherapeutic drugs. 

The function of OS-D proteins is largely unknown. An OS-D nucleic acid 
molecule isolated from a Drosophila melanogaster antenna cDNA library encodes a 
protein that shares features common to vertebrate odorant-binding proteins, but has a 
primary structure unlike odorant-binding proteins. The encoded protein is also 

15 homologous to a family of soluble chemosensory proteins from the chemosensory organ 
of the desert locust, Schistocerca gregaria. As such, a flea OS-D protein of the present 
invention represents a novel target for anti-flea vaccines and chemotherapeutic drugs. 

NMDA receptors are a subtype of glutamate-gated ion channels. All glutamate- 
gated ion channels transmit Na+ and K+ when stimulated, resulting in a depolarization of 

20 the membrane potential. NMDA receptors also transport Ca++ into cells upon 

stimulation, which distinguishes NMDA receptors from the other glutamate-gated ion 
channels. NMDA receptors play an important role in glutamate excitotoxicity, which has 
been linked to a number of neurodegenerative disorders such as focal cerebral ischemia 
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(stroke), Parkinson's disease, Huntington's chorea, Alzheimer's disease, schizophrenia 
and epilepsy. It is thought that the Ca++ influx in open NMDA channels is the mediator 
for these diseases, since the increase in intracellular Ca-H- concentration leads to the 
induction of metabolic changes in the cell, including the activation of Ca++ dependent 
5 proteases and production of free-oxygen radicals. As such, a flea NMDA protein of the 
present invention represents a novel target for anti-flea vaccines and chemotherapeutic 
drugs. 

CLBP proteins of the present invention appear to fall into the family of 
PBP/GOBP proteins (pheromone binding protein/general odorant binding protein) based 

10 on sequence homology with members of this family (30% identity with PBPRP-2, 
pheromone binding protein related protein #2 of Drosophila melanogaster, and 
approximately the same identity with CSRLLBP, chemical sense related lipophilic ligand 
binding protein of Phormia regina). Without being bound by theory, it is believed that 
these proteins are involved in the perception of odors or pheromones, such as the ability 

15 to sense the presence of a host or mate. As such, a flea CLBP protein of the present 
invention represents a novel target for anti-flea vaccines and chemotherapeutic drugs. 

Peritrophins, including flea PL2, PL3 and PL4 proteins of the present invention, 
are a family of putative chitin-binding proteins that comprise a structural component of 
the peritrophic matrix, an acellular membrane composed of proteins and sugars, most 

20 commonly chitin which forms a barrier between the contents of an ingested meal and the 
gut epithelia. Peritrophin-like proteins have also been shown to be present in the trachea 
of Drosophila embryos, indicating that such proteins may have additional roles outside 
the midgut. The function of the peritrophin-like proteins in adult fleas is not clear, since 
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adult fleas do not produce a peritrophic matrix in the gut. Peritrophins have been 
investigated as targets for immunological control of hematophagous insects including the 
sheep blowfly, Lucilia cuprina. It has been shown in this insect that ingestion of 
antibodies against peritrophins inhibits the growth of larvae and can result in increased 

5 larval mortality. It has also been shown that the ingestion of antibodies against 

peritrophins reduces the permeability of the peritrophic matrix in L. cuprina larvae. This 
in turn may inhibit the movement of digested food across the peritrophic matrix to the gut 
epithelium, resulting in starvation. As such, a flea peritrophin of the present invention 
represents a novel target for anti-flea vaccines and chemotherapeutic drugs. 

10 In general, voltage-gated chloride channels (VGCC) maintain resting epithelial 

and neural membrane potentials and prevent hyperexcitability (sustained contraction) in 
muscle cells. In Drosophila Malpighian tubules, the diuretic hormone leukokinin has 
been shown to stimulate voltage-gated chloride channels in the stellate cells by increasing 
intracellular calcium levels. The flea VGCC protein sequence of the present invention 

15 contains an EF-hand calcium binding motif, indicating potential regulation by calcium 
ions, and thus a possible link to leukokinins and diuresis. Chloride channels are critical 
for diuresis since chloride is the primary anion driving diuresis and is required to help 
neutralize the sodium and potassium cations that are secreted into the lumen in response 
to diuretic peptide. The mRNA for the VGCC of the present invention has been shown to 

20 be HMT-specific in adult fleas, indicating a potential role in diuresis. As such, a flea 
VGCC of the present invention represents a novel target for anti-flea vaccines and 
chemotherapeutic drugs. 
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The CLIC family of chloride channels are voltage-gated chloride channels that are 
expressed on a variety of vesicles and are thought to act in concert with the V-ATPase 
pump to regulate the pH of the vesicle interior. Members of the CLIC family have also 
been shown to be expressed on the plasma membrane, again, in association with the V- 
ATPase pump. In humans, a homologous protein has been shown to be expressed on the 
plasma membrane in epithelial tissues, suggesting a possible role in transepithelial 
chloride transport and in cows, an antibody against a homologous channel has been 
shown to inhibit all chloride conductance in kidney microsomes. If the CLIC gene 
product is indeed involved in transepithelial chloride transport in HMT tissues, it likely 
plays a critical role in mediating diuresis. As such, a flea CLIC of the present invention 
represents a novel target for anti-flea vaccines and chemotherapeutic drugs. 

The NAH exchanger uses the proton gradient in the lumen of the Malpighian 
tubule to power the transport of sodium ions across the apical membrane into the lumen. 
The transport of sodium ions across the Malpighian tubule epithelia is induced by diuretic 
peptide and is a critical step in the induction of diuresis. The Northern blot analysis 
described herein indicates that NAH mRNA is upregulated within 15 minutes of feeding 
in adults, which is consistent with a molecule having a role in diuresis. In many insects, 
sodium has been shown to be the principle ion driving diuresis. The NAH exchanger has 
been shown to be located on the apical membrane in the Malpighian tubules, but may 
also be located in the hindgut and rectum. If located in the hindgut and rectum, it could 
be accessible to antibody attack on either the basolateral or apical membranes. As such, a 
flea NAH of the present invention represents a novel target for anti-flea vaccines and 
chemotherapeutic drugs. 
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S VP proteins have structural and sequence conservation with a bacterial family of 
proton co-transporters, with the mammalian proton/glucose transporter, and with organic 
ion transporters. SVP has 12 putative transmembrane regions that arise from an internal 
duplication. In mammals, it is located on neural and endocrine vesicles and is thought to 

5 function in the uptake of neurotransmitters into vesicles utilizing the proton gradient. 
Neurotransmitters in turn regulate the activity the ion channels on these membranes. In 
the Malpighian tubules, the activity of the ion channels determines the rate of diuresis, or 
fluid secretion from the hemolymph into the lumen. Thus, inhibiting the transport of 
neurotransmitters in the HMT tissues may have significant effects on the functions of 

10 these tissues. As such, a flea SVP of the present invention represents a novel target for 
anti-flea vaccines and chemotherapeutic drugs. 

The function of flea AUP proteins is largely unknown. C. felis AUP shares some 
homology to Drosophila melanogaster anoxia-regulated gene product fau. The 
Drosophila melanogaster fau gene has no homology to previously described database 

15 entries, but localizes to laminal and cortical neurons of the Drosophila CNS by in situ 
hybridization, and plays and important role in response to 02 deprivation as measured by 
impaired recovery time of transgenic flies over-expressing fau to anoxia. As such, a flea 
AUP of the present invention represents a novel target for anti-flea vaccines and 
chemotherapeutic drugs. 

20 A flea 7B2 protein has some BLAST homology to the neuroendocrine protein 

7B2 from various organisms, including Drosophila, C. elegans, the pond snail Lymnaea 
stagnalis, and humans. 7B2 has been implicated in activation of prohormone convertase 
2 (PC2) an important neuroendocrine precursor processing endoprotease. Additionally, 
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7B2 was found to be critical in islet hormone processing in mice using null mutants 
which displayed hypoglycemia, hyperproinsulinemia and hypoglucagonemia. As such, a 
flea 7B2 of the present invention represents a novel target for anti-flea vaccines and 
chemotherapeutic drugs. 

5 Flea allantoinase nucleic acid molecules of known length isolated from C felis are 

denoted "nCfALN # ", for example nCfALN 2057 , wherein "#" refers to the number of 
nucleotides in that molecule, and allantoinase proteins of known length are denoted 
"PCfALN # " (for example PCfALN 384 ) wherein refers to the number of amino acid 
residues in that molecule. Similarly, C.felis CBP nucleic acid molecules and proteins of 

1 0 known length are denoted "nCfCBP/' and "PCfCBP # ", respectively; C felis NKAB 
nucleic acid molecules and proteins of known length are denoted "nCfNKAB/' and 
"PCfNKAB # ", respectively; C.felis LGIC nucleic acid molecules and proteins of known 
length are denoted "nCfLGIC # " and "PCfLGIC/', respectively; C felis ANON nucleic 
acid molecules and proteins of known length are denoted "nCfANON/' and 

1 5 "PCfANON # ", respectively; C. felis MALV nucleic acid molecules and proteins of known 
length are denoted "nCfMALV # " and "PCfMALV # " respectively; C felis OS-D nucleic 
acid molecules and proteins of known length are denoted "nCfOSD # " and "PCfOSD # 
respectively; C felis NMDA nucleic acid molecules and proteins of known length are 
denoted "nCfNMDA # " and "PCfNMDAa respectively; C felis CLBP nucleic acid 

20 molecules and proteins of known length are denoted "nCfCLBP # " and "PCfCLBP # 
respectively, C. felis NAH nucleic acid molecules and proteins of known length are 
denoted u nCf NAH/' and "PCf NAIL respectively, C felis CLIC nucleic acid molecules 
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and proteins of known length are denoted "nCfCLIQ" and "PCfCLIQ respectively, C. 
felis PL2 nucleic acid molecules and proteins of known length are denoted "nCfPL2 # " and 
"PCfPL2 # respectively, C. felis PL3 nucleic acid molecules and proteins of known length 
are denoted "nCfPL3 # " and "PCfPL3 # respectively, C felis PL4 nucleic acid molecules 

5 and proteins of known length are denoted "nCfPL4 # " and "PCfPL4 # respectively, C. felis 
SVP nucleic acid molecules and proteins of known length are denoted "nCfSVP # " and 
"PCfSVP # respectively, C. felis VGCC nucleic acid molecules and proteins of known 
length are denoted "nCfVGCQ" and "PCfVGCC # respectively, C felis AUP nucleic acid 
molecules and proteins of known length are denoted "nCAUP f # " and "PCfAUP # 

10 respectively, and C. felis 7B2 nucleic acid molecules and proteins of known length are 
denoted "nCf7B2 # " and "PCf7B2 # respectively. 

The present invention also provides for HMT and HNC DNA molecules that are 
specific tags for messenger RNA molecules derived from HMT and HNC tissues. Such 
DNA molecules can correspond to an entire or partial sequence of a messenger RNA, and 

15 therefore, a DNA molecule corresponding to such a messenger RNA molecule (i.e. a 
cDNA molecule), can encode a full-length or partial-length protein. A nucleic acid 
molecule encoding a partial-length protein can be used directly as a probe or indirectly to 
generate primers to identify and/or isolate a cDNA nucleic acid molecule encoding a 
corresponding, or structurally related, full-length protein. Such a partial cDNA nucleic 

20 acid molecule can also be used in a similar manner to identify a genomic nucleic acid 
molecule, such as a nucleic acid molecule that contains the complete gene including 
regulatory regions, exons and introns. Methods for using partial HMT and HNC cDNA 
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molecules and sequences to isolate full-length transcripts and corresponding cDNA 
molecules are described in the examples herein below. 

The proteins and nucleic acid molecules of the present invention can be obtained 
from their natural source, or can be produced using, for example, recombinant nucleic 
acid technology or chemical synthesis. Also included in the present invention is the use of 
these proteins and nucleic acid molecules as well as antibodies and inhibitory compounds 
thereto as therapeutic compositions to protect animals from flea infestation as well as in 
other applications, such as those disclosed below. 

Flea HMT and HNC proteins and nucleic acid molecules of the present invention 
have utility because they represent novel targets for anti-arthropod vaccines and 
chemotherapeutic drugs. The products and processes of the present invention are 
advantageous because they enable the inhibition of arthropod development, 
metamorphosis, feeding, digestion and/or reproduction processes that involve HMT 
and/or HNC proteins. 

The head and nerve cord of the flea, including antennae, brain, corpora cardiacum, 
corpora allata, and subesophageal and abdominal ganglion tissues are of interest as such 
tissues are highly enriched for transcripts that encode neuronal and endocrine targets, as 
well as targets involved in chemosensory and mechanosensory reception. By sequencing 
cDNA fragments from a library enriched in flea head and nerve cord nucleic acid 
sequences (referred to herein as HNC nucleic acid sequences), genes, and their respective 
full-length coding regions, integrally involved with flea neuronal and endocrine function 
are identified. Once identified, these genes can be further characterized and specific 
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interference strategies are designed. As such, flea HNC proteins and nucleic acid 
molecules of the present invention have utility because they represent novel targets for 
anti-arthropod vaccines and chemotherapeutic drugs. 

Blood-feeding insects such as fleas ingest large quantities of blood relative to their 
body weight and, as such, are adapted to reduce the volume of the ingested blood meal 
through the rapid elimination of water. In addition, the concentrations of sodium, 
potassium, and chloride ions in the blood meal are greater than in the hemolymph of 
fleas, necessitating the excretion of excessive amounts of these ions. The active transport 
of these ions from the hemolymph into the lumens of the Malpighian tubules and the 
hindgut drives the passive transport of water and other hemolymph contents into these 
organs as well. While passing through these organs, waste products from the hemolymph 
are excreted and needed nutrients, water, and salts are reabsorbed. As such, interfering 
with these essential processes is an important strategy for developing a product for 
controlling flea populations. By sequencing cDNA fragments from a library enriched in 
hindgut and Malpighian tubule nucleic acid sequences (referred to herein as HMT nucleic 
acid sequences), genes integrally involved with these processes, and their respective full- 
length coding regions, are identified. Once identified, these genes are further 
characterized and specific interference strategies can be designed. As such, flea HMT 
proteins and nucleic acid molecules of the present invention have utility because they 
represent novel targets for anti-arthropod vaccines and chemotherapeutic drugs. 

One embodiment of the present invention is an isolated protein that includes a flea 
HMT and/or HNC protein. It is to be noted that the term "a" or "an" entity refers to one 
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or more of that entity; for example, a protein, a nucleic acid molecule, an antibody and a 
therapeutic composition refers to "one or more" or "at least one" protein, nucleic acid 
molecule, antibody and therapeutic composition respectively. As such, the terms "a" (or 
"an"), "one or more" and "at least one" can be used interchangeably herein. It is also to be 
noted that the terms "comprising", "including", and "having" can be used interchangeably. 
According to the present invention, an isolated, or biologically pure, protein, is a protein 
that has been removed from its natural milieu. As such, "isolated" and "biologically 
pure" do not necessarily reflect the extent to which the protein has been purified. An 
isolated protein of the present invention can be obtained from its natural source, can be 
produced using recombinant DNA technology, or can be produced by chemical synthesis. 

As used herein, isolated flea HMT and/or HNC proteins of the present invention 
can be full-length proteins or any homologue of such proteins. An isolated protein of the 
present invention, including a homologue, can be identified in a straight-forward manner 
by the protein's ability to elicit an immune response against a flea HMT and/or HNC 
protein or by the protein's HMT and/or HNC activity. Examples of flea HMT and HNC 
homologue proteins include flea HMT and HNC proteins in which amino acids have 
been deleted (e.g., a truncated version of the protein, such as a peptide), inserted, 
inverted, substituted and/or derivatized (e.g., by glycosylation, phosphorylation, 
acetylation, myristoylation, prenylation, palmitoylation, amidation and/or addition of 
glycerophosphatidyl inositol) such that the homologue includes at least one epitope 
capable of eliciting an immune response against a flea HMT or HNC protein, and/or of 
binding to an antibody directed against a flea HMT or HNC protein. That is, when the 
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homologue is administered to an animal as an immunogen, using techniques known to 
those skilled in the art, the animal will produce an immune response against at least one 
epitope of a natural flea HMT or HNC protein. The ability of a protein to effect an 
immune response can be measured using techniques known to those skilled in the art. As 
used herein, the term "epitope" refers to the smallest portion of a protein or other antigen 
capable of selectively binding to the antigen binding site of an antibody or a T cell 
receptor. It is well accepted by those skilled in the art that the minimal size of a protein 
epitope is about four to six amino acids. As is appreciated by those skilled in the art, an 
epitope can include amino acids that naturally are contiguous to each other as well as 
amino acids that, due to the tertiary structure of the natural protein, are in sufficiently 
close proximity to form an epitope. According to the present invention, an epitope 
includes a portion of a protein comprising at least about 4 amino acids, at least about 5 
amino acids, at least about 6 amino acids, at least about 10 amino acids, at least about 15 
amino acids, at least about 20 amino acids, at least about 25 amino acids, at least about 30 
amino acids, at least about 35 amino acids, at least about 40 amino acids or at least about 
50 amino acids in length. 

In one embodiment of the present invention a flea homologue protein has HMT or 
HNC activity, i.e. the homologue exhibits an activity similar to its natural counterpart. 
Examples of such activities are disclosed herein; e.g., all. Methods to detect and measure 
such activities are known to those skilled in the art. Examples of such activities are 
disclosed herein; e.g. allantoinase, chitin-binding protein, sodium/potassium ATPase, 
ligand-gated chloride channel, ANON/23DA, malvolio, odorant binding protein-like 
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protein, N-methyl-D-aspartate receptor associated protein, chemical sense related 
lipophilic ligand binding protein, Sodium/Hydrogen Transporter-like protein, a Chloride 
Intracellular Channel-like protein, aPeritrophin-like protein, aPeritrophin-like protein, 
aPeritrophin-like protein, a synaptic vesicle 2B-like protein, a voltage-gated Chloride-like 
protein, an anoxia upregulated protein-like protein, and a neuroendocrine specific 7B2- 
like protein. 

Flea HMT and/or HNC homologue proteins can be the result of natural allelic 
variation or natural mutation. Flea HMT and/or HNC protein homologues of the present 
invention can also be produced using techniques known in the art including, but not 
limited to, direct modifications to the protein or modifications to the gene encoding the 
protein using, for example, classic or recombinant DNA techniques to effect random or 
targeted mutagenesis. 

Flea HMT and HNC proteins of the present invention are encoded by flea HMT 
and HNC nucleic acid molecules, respectively. As used herein, flea HMT and HNC 
nucleic acid molecules include nucleic acid sequences related to natural flea HMT and 
HNC genes, and, preferably, to Ctenocephalides felis HMT and HNC genes. As used 
herein, flea HMT and HNC genes include all regions such as regulatory regions that 
control production of flea HMT and HNC proteins encoded by such genes (such as, but 
not limited to, transcription, translation or post-translation control regions) as well as the 
coding region itself, and any introns or non-translated coding regions. As used herein, a 
nucleic acid molecule that "includes" or "comprises" a sequence may include that 
sequence in one contiguous array, or may include the sequence as fragmented exons such 
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as is often found for a flea gene. As used herein, the term "coding region" refers to a 
continuous linear array of nucleotides that translates into a protein. A full-length coding 
region is that coding region that is translated into a full-length, i.e., a complete protein as 
would be initially translated in its natural millieu, prior to any post-translational 
modifications. 

One embodiment of the present invention is a C.felis ALN gene that includes the 
nucleic acid sequence SEQ ID NO: 1 and/or SEQ ID NO:4, a C.felis CBP gene that 
includes the nucleic acid sequence SEQ ID NO:7 and/or SEQ ID NO: 10, a C.felis NKAB 
gene that includes the nucleic acid sequence SEQ ID NO: 13 and/or SEQ ID NO: 16, a C. 
felis LGIC gene that includes the nucleic acid sequence SEQ ID NO:19, SEQ ID NO:22, 
SEQ ID NO:1861, and/or SEQ ED NO:1864, a C.felis ANON gene that includes the 
nucleic acid sequence SEQ ID NO:25 and/or SEQ ID NO:28, a C. felis MALV gene that 
includes the nucleic acid sequence SEQ ID NO:3 1 and/or SEQ ID NO:34, a C.felis OS-D 
gene that includes the nucleic acid sequence SEQ ID NO:37 and/or SEQ ID NO:40, a C. 
felis NMD A gene that includes the nucleic acid sequence SEQ ID NO: 43 and/or SEQ ID 
NO:46, a C.felis CLBP gene that includes the nucleic acid sequence SEQ ID NO: 153, 
SEQ ID NO:156, SEQ ID NO:159, SEQ ID NO: 162, SEQ ID NO:165, and/or SEQ ID 
NO: 168, a C.felis NAH gene that includes the nucleic acid sequence SEQ ID NO: 1867 
and/or SEQ ID NO: 1 870, a C. felis CLIC gene that includes the nucleic acid sequence 
SEQ ID NO: 1872 and/or SEQ ID NO: 1875, a C.felis PL2 gene that includes the nucleic 
acid sequence SEQ ID NO: 1877, SEQ ID NO: 1878, SEQ ID NO: 1880, SEQ ID NO: 1882 
and/or SEQ ID NO: 1885, a C.felis PL3 gene that includes the nucleic acid sequence SEQ 
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ID NO: 1887 and/or SEQ ID NO:1890, a C.felis PL4 gene that includes the nucleic acid 
sequence SEQ ID NO:1896 and/or SEQ ID NO:1899, a C.felis SVP gene that includes 
the nucleic acid sequence SEQ ID NO:1901 and/or SEQ ID NO:1904, a C.felis VGCC 
gene that includes the nucleic acid sequence SEQ ID NO:1914 and/or SEQ ID NO:1917, 
5 a C.felis AUP gene that includes the nucleic acid sequence SEQ ID NO:1919 and/or SEQ 
ID NO:1922, a C.felis 7B2 gene that includes the nucleic acid sequence SEQ ID 
NO: 1924 and/or SEQ ID NO: 1927, a C.felis gene that includes a nucleic acid sequence 
of Table I, Table H, Table HI and/or Table IV; as well as the complements of any of these 
nucleic acid sequences. These nucleic acid sequences are further described herein. For 
10 example, nucleic acid sequence SEQ ID NO:l represents the deduced sequence of the 
coding strand of a C. felis cDNA denoted herein as C. felis ALN nucleic acid molecule 
nCfALN 2057 , the production of which is disclosed in the Examples. Nucleic acid 
molecule SEQ ID NO: 1 comprises an apparently full-length coding region. The 
complement of SEQ ID NO:l (represented herein by SEQ ID NO:3) refers to the nucleic 
acid sequence of the strand fully complementary to the strand having SEQ ID NO: 1, 
which can easily be determined by those skilled in the art. Likewise, a nucleic acid 
sequence complement of any nucleic acid sequence of the present invention refers to the 
nucleic acid sequence of the nucleic acid strand that is fully complementary to (i.e., can 
form a complete double helix with) the strand for which the sequence is cited. For 
example, the complements of SEQ ID NOs: 4, 7, 10, 13, 16, 19, 22, 25, 28, 3 1, 34, 37, 
40, 43, 46, 153, 156, 159, 162, 165, and 168 are SEQ ID NOs: 6, 9, 12, 15, 18, 21, 24, 27, 
30, 33, 36, 39, 42, 45, 48, 155, 158, 161, 164, 167, and 170, respectively. It should be 
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noted that since nucleic acid sequencing technology is not entirely error-free, SEQ ID 
NO: 1 (as well as other nucleic acid and protein sequences presented herein) represents an 
apparent nucleic acid sequence of the nucleic acid molecule encoding an ALN protein of 
the present invention. 

Translation of SEQ ID NO:l, the coding strand of nCfALN 2057 , as well as 
translation of SEQ ID NO:4, the coding strand of nCfALN U52 , which represents the 
coding region of SEQ ID NO:l, each yields a protein of about 384 amino acids, denoted 
herein as PCfALN 384 , the amino acid sequence of which is presented in SEQ ID NO:2, 
assuming a first in-frame codon extending from nucleotide 1 to nucleotide 3 of SEQ ID 
NO:4. 

Translation of SEQ ID NO:7, the coding strand of nCfCBP 1128 , as well as 
translation of SEQ ID NO: 10, the coding strand of nCfCBP 1128 , which represents the 
coding region of SEQ ID NO:7, each yields a protein of about 272 amino acids, denoted 
herein as PCfCBP 272 , the amino acid sequence of which is presented in SEQ ID NO:8, 
assuming a first in-frame codon extending from nucleotide 1 to nucleotide 3 of SEQ ID 
NO: 10. 

Translation of SEQ ID NO: 13, the coding strand of nCfNKAB 1714 , as well as 
translation of SEQ ID NO: 16, the coding strand of nCfNKAB 978 , which represents the 
coding region of SEQ ID NO: 13, each yields a protein of about 326 amino acids, denoted 
herein as PCfNKAB 326 , the amino acid sequence of which is presented in SEQ ID NO: 14, 
assuming a first in-frame codon extending from nucleotide 1 to nucleotide 3 of SEQ ID 
NO:16. 
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Translation of SEQ ID NO: 1 9, the coding strand of nCfLGIC 2240 , as well as 
translation of SEQ ID NO:22, the coding strand of nCfLGIC 1707 , which represents the 
coding region of SEQ ID NO:19, each yields a protein of about 569 amino acids, denoted 
herein as PCfLGIC 569 , the amino acid sequence of which is presented in SEQ ID NO:20, 
assuming a first in-frame codon extending from nucleotide 1 to nucleotide 3 of SEQ ID 
NO:22. 

Translation of SEQ ID NO:25, the coding strand of nCfANON 1429 , as well as 
translation of SEQ ID NO:28, the coding strand of nCfANON U94 , which represents the 
coding region of SEQ ID NO:25, each yields a protein of about 398 amino acids, denoted 
herein as PCfANON 398 , the amino acid sequence of which is presented in SEQ ID NO:26, 
assuming a first in-frame codon extending from nucleotide 1 to nucleotide 3 of SEQ ID 
NO:28. 

Translation of SEQIDNO:31, the coding strand of nCfMALV 765 , as well as 
translation of SEQ ID NO:34, the coding strand of nCfMALV 762 , which represents the 
coding region of SEQ ID NO:3 1 . each yields a protein of about 327 amino acids, denoted 
herein as PCfMALV 254 , the amino acid sequence of which is presented in SEQ ID NO:32, 
assuming a first in-frame codon extending from nucleotide 1 to nucleotide 3 of SEQ ID 
NO:34. 

Translation of SEQ ID NO:37, the coding strand of nCfOSD^, as well as 
translation of SEQ ID NO:40, the coding strand of nCfOSD 405 , which represents the 
coding region of SEQ ID NO:37, each yields a protein of about 135 amino acids, denoted 
herein as PCfOSD I35 , the amino acid sequence of which is presented in SEQ ID NO:38, 
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turning a firs, in-frame codon extending from nuc.eotide 1 to nucleotide 3 of SEQ ID 
NO:40. 

Translation of SEQ ID NO:43, the coding strand ofNMDA, 227 , as weU as 
translation of SEQ ID NO:46, the coding str and of nCfNMDA,,,, which represents me 
coding region o, SEQ m NO:43, each yields a protein of about 246 amino acids, denoted 
herein as PCfNMDA^, the amino acid sequence of which is presented in SEQ ID 
NO:44, assuming a first in-frame codon extending from nucleotide . to nucleotide 3 of 
SEQ ID NO:46. 

Translation of SEQ ID NO: 153, the coding strand of nCfCLBPl A^, as well as 
translation of SEQ ID NO ,56, the coding strand of „CfCLBPlA„„ which represents the 
coding region of SEQIDNO:,53, each yields a protein of about 147 amino acids, 
denoted herein as PCfCLBP,.,, the amino acid sequence of which is presented in SEQ ID 
NO:154, assuming a firs, in-frame codon extending from nucleotide 1 to nucleotide 3 of 
SEQ ID NO: 156. 

Translation of SEQ ID NO:162, the coding strand of nCfCLBI^,, as well as 
translation of SEQ ID NO: 165, the coding strand of nCfCLBPa^,, which represents the 
coding region of SEQ ID NO:.62, each yie.d S a protein of about .47 amino acids, 
denoted herein as PCfCLBP2A 14I , One amino acid sequence of which is presented in SEQ . 
ID NO: 163, assuming a firs, in-frame codon extending from nucleotide 1 ,o nucleotide 3 
ofSEQIDNO:165. 

Translation of SEQIDNO:1861, ,he coding strand of n C fLG,C„„ as we., as 
—on ofSEQIDNO:1864, the coding strand of nOLGIC,.,., which represents the 



-28- 



FC-6-C1 

coding region of SEQ ID NO: 1861, each yields a protein of about 672 amino acids, 
denoted herein as PCfLGIC 672 , the amino acid sequence of which is presented in SEQ ID 
NO:1862, assuming a first in-frame codon extending from nucleotide 1 to nucleotide 3 of 
SEQ ID NO: 1864. 

Translation of SEQ ID NO:1867, the coding strand of nCfNAH^, as well as 
translation of SEQ ID NO.1870, the coding strand of nCfNAH, M4 , which represents the 
coding region of SEQ ID NO:1867, each yields a protein of about 608 amino acids, 
denoted herein as PCfNAH 608 , the amino acid sequence of which is presented in SEQ ID 
NO: 1868, assuming a first in-frame codon extending from nucleotide 1 to nucleotide 3 of 
SEQ ID NO: 1870. 

Translation of SEQ ID NO:1872, the coding strand of nCfCLIC 2283 , as well as 
translation of SEQ ID NO: 1875, the coding strand of nCfCLIC 786 , which represents the 
coding region of SEQ ID NO: 1872, each yields a protein of about 262 amino acids, 
denoted herein as PCfCLIC 262 , the amino acid sequence of which is presented in SEQ ID 
NO: 1873, assuming a first in-frame codon extending from nucleotide 1 to nucleotide 3 of 
SEQ ID NO: 1875. 

Translation of SEQ ID NO:1882, the coding strand of nCfPL2 1477 , as well as 
translation of SEQ ID NO: 1885, the coding strand of nCfPL2 1339 , which represents the 
coding region of SEQ ID NO: 1882, each yields a protein of about 453 amino acids, 
denoted herein as PCfPL2 453 , the amino acid sequence of which is presented in SEQ ID 
NO: 1883, assuming a first in-frame codon extending from nucleotide 1 to nucleotide 3 of 
SEQ ID NO: 1885. 
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Translation of SEQ IE> NO: 1887, the coding strand of nCfPI^, as well as 
translation of SEQ ID NO: 1890, the coding strand of nC£PL3 243 , which represents the 
coding region of SEQ ID NO: 1887, each yields a protein of about 81 amino acids, 
denoted herein as PCfPL3 81 , the amino acid sequence of which is presented in SEQ ID 
5 NO: 1888, assuming a first in-frame codon extending from nucleotide 1 to nucleotide 3 of 
SEQ ID NO: 1890. 

Translation of SEQ ID NO: 1896, the coding strand of nCfPL4 l062 , as well as 
translation of SEQ ID NO: 1899, the coding strand of nCfPL4 R55 , which represents the 
coding region of SEQ ID NO: 1896, each yields a protein of about 285 amino acids, 
10 denoted herein as PCfPL4 285 , the amino acid sequence of which is presented in SEQ ID 
NO: 1897, assuming a first in-frame codon extending from nucleotide 1 to nucleotide 3 of 
SEQ ID NO: 1899. 

Translation of SEQ ID NO: 1901, the coding strand of nCfSVP lfi75 , as well as 
translation of SEQ ID NO: 1904, the coding strand of nCfSVP 1590 , which represents the 
15 coding region of SEQ ID NO: 1901 , each yields a protein of about 530 amino acids, 

denoted herein as PCfSVP 530 , the amino acid sequence of which is presented in SEQ ID 
NO: 1902, assuming a first in-frame codon extending from nucleotide 1 to nucleotide 3 of 
SEQ ID NO: 1904. 

Translation of SEQ ID NO:1914, the coding strand of nCfVGCC 3l26 , as well as 
20 translation of SEQ ID NO: 1917, the coding strand of nCfVGCC^, which represents the 
coding region of SEQ ID NO: 1914, each yields a protein of about 85 1 amino acids, 
denoted herein as PCfVGCCg 51 , the amino acid sequence of which is presented in SEQ ID 
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NO: 1915, assuming a first in-frame codon extending from nucleotide 1 to nucleotide 3 of 
SEQIDNO:1917. 

Translation of SEQ ID NO: 19 19, the coding strand of nCfAUP 118l , as well as 
translation of SEQ ID NO: 1922, the coding strand of nCfAUP 306 , which represents the 
coding region of SEQ ID NO: 1919, each yields a protein of about 102 amino acids, 
denoted herein as PCfAUP 102 , the amino acid sequence of which is presented in SEQ ID 
NO: 1920, assuming a first in-frame codon extending from nucleotide 1 to nucleotide 3 of 
SEQ ID NO: 1922. 

Translation of SEQ ID NO: 1924, the coding strand of nCf7B2 2I61 , as well as 
translation of SEQ ID NO: 1927, the coding strand of nCf7B2 801 , which represents the 
coding region of SEQ ID NO: 1924, each yields a protein of about 267 amino acids, 
denoted herein as PCf7B2 267 , the amino acid sequence of which is presented in SEQ ID 
NO: 1925, assuming a first in-frame codon extending from nucleotide 1 to nucleotide 3 of 
SEQ ID NO: 1927. 

Table I represents a variety of flea HNC nucleic acid molecules of the present 
invention. Also cited in Table I are nucleic acid molecules from other organisms which 
share the closest sequence identity with the cited HNC sequences of the present 
invention, as determined by submitting each HNC sequence for a search through the 
National Center for Biotechnology Information (NCBI), National Library of Medicine, 
National Institute of Health, Baltimore, MD, using the BLAST network. This database 
includes SwissProt + PIR + SPupdate + GenPept + GPUpdate + PDB databases. The 
search was conducted using the xBLAST function using default parameters. 
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TABLE I 



;(SEQllD:;f\KD: 


SName < 






63 


2096-46 / 


MPase 6 j 


D. melanogaster 


64 


2098-25 i 


MP synthase delta chain 


Sus scrofa 


65 


2098-34 


F1 -ATPase epsilon-subunit 


ipomoea batatas 


66 


2110-19 i 


MP synthase beta subunit 


Drosophila pseudoobscura 


67 


2113-15 i 


MP synthase delta chain, 


Sus scrofa 


68 


2180-31 , 


MP synthase alpha subunit precursor 


Rattus rattus 


69 


2224-50 


aligomysin sensitivity conferring protein 


D. melanogaster 


70 


2116-51 


cysteine dioxygenase 


Homo sapiens 


71 


2116-55 


pyrroline-5-carboxylate 
dehydrogenase (P5CDh) 


Homo sapiens 


72 


2124-17 


AMP deaminase 


Homo sapiens 


73 


2138-38 


ubiquitin 


Mas musculus 


74 


2184-59 


manganese superoxide dismutase 


Homo sapiens 


75 


2096-24 


muscle LIM protein 1 


D. melanogaster 


76 


2140-53 


F25H5.1a 


C. elegans 


77 


2176-41 


Frazzled 


D. melanogaster 


78 


2223-11 


JMm domain-containing protein 


C. elegans 


79 


2223-53 


deleted in split hand/split foot 1 (DSS1) 


Homo sapiens 


80 


2225-28 


stranded-at-second 


D. melanogaster 


81 


2099-61 


histone H3 


Spisula solidissima 


82 


2114-21 


STE12 


S. cereviciae 


83 


2117-4 


Rad51 homolog 


Bombyx mod 


84 


2138-46 


heat shock protein p27 


D. immitis 


85 


2182-37 


heat shock protein 70 


D. immitis 


86 


2211-32 


BTB-ll protein domain gene 


D. melanogaster 


87 


2223-7 


heat shock protein 


D, melanogaster 


88 


2224-17 


heat shock protein 86 


Homo sapiens 


89 


2225-16 


POU domain protein 


D. melanogaster 


90 


2225-18 


nucleolin 


Xenopus laevis 


91 


2212-85 


thyroid hormone receptor-associated 
protein complex component TRAP220 


Homo sapiens 


92 


2211-21 


T03D8.3 


C. elegans 


93 


2223-67 


hepatoma derived growth factor (HDGF) 


Mus musculus 


94 


2225-61 


tyrosine hydroxylase type 1 (neuronal 
form) 


O. melanogaster 


95 


2097-7 


sarco/endoplasmic reticulum-type 

Ua-tt-M i rase 


D. melanogaster 






r'alr'ii im-trancnnrtinn ATPnco 
L/CtlOIUI 1 1 11 dl lo|JUl Ul iy r\ \ rdoc 


u. 1 1 icidi luyctoici 


97 


2099-19 


calcium channel alpha-1 subunit 


Aplysia californica 


GO 

ac 




r-iype voiiage-gaieo caicium cnannei 
alpha 1 subunit homolog 


nomo sapiens 


9S 


2124-2 


sarco/endoplasmic reticulum 
Ca2+-ATPase (SERCA) 


Procambarus clarkii 


10C 


I 2182-43 


sulfonylurea receptor 2b 


Mus musculus 


101 


2223-18 


Sodium-Potassium-Chloride cotransporter 


D. melanogaster 


102 


! 2223-63 


sarco/endoplasmic reticulum-type 
Ca2(+)-ATPase 


D. melanogaster 
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BelbifMSmol^^ -flS< 




103 


2224-1 3 J 


similar to ABC transporters ( 


0. elegans 


104 


2098-3 ( 


Damguk 


0. melanogaster 


105 


2101-9 I 


JNC-89 i 


0. elegans 


106 


2132-31 ; 


arginine kinase 


Homarus gammarus 


107 


2141-51 ( 


;asein kinase-ll beta 


Oryctolagus cuniculus 


108 


2178-18 < 


diacylglycerol kinase eta 


Cricetinae 


109 


2180-32 


'etinoid- and fatty acid-binding 
glycoprotein 


0. m elanogaser 


110 


2137-23 


vitellogenin 


Aedes aegypti 


111 


2144-14 


nuclear localization signal spot 1 


Mus musculus 


112 


2212-13 


putative n- terminal acetyltransf erase 


S. cereviciae 


113 


2212-27 


clathrin associated protein AP47 


Drosophila grimshawi 


114 


2223-28 


01 chloroquine-resistance protein 


Plasmodium falciparans 


115 


2224-14 


vitellogenin 


Athalia rosae 


116 


2224-15 


antigen NY-CO-3 


Homo sapiens 


117 


2225-24 


carbonic anhydrase 


C. elegans 


118 


2225-58 


yk500f6.3 


C. elegans 


119 


2225-76 


unknown 


Homo sapiens 


120 


2224-86 


BmP109 (cerebroside sulfate activator 
protein family) 


Bombyx mori 


121 


2225-23 


intersectin 


Homo sapiens 


122 


2170-16 


chemical-sense-related 
liDODhilic-liaand-bindino Drotein 


Phormia regina 


123 


2176-2 


olfactory receptor protein 2.4 


Danio rerio 


124 


2212-63 


olfactory receptor 


Xenopus laevis 


125 


2224-77 


inner mitochondrial membrane 
translocase Tim23 


Homo sapiens 


126 


2225-12 


sodium-dependent multi-vitamin 
transporter 


Rattus non/egicus 


127 


2225-42 


ribophorin 1 


Rattus non/egicus 


128 


2101-59 


phosphate carrier protein 


C. elegans 


129 


2132-38 


proteinase inhibitor 


Locusta migratoria 


130 


2174-72 


HE4 protein 


Homo sapiens 


131 


2211-48 


spermatogenic cell/sperm-associated 
Tat-binding homologue 


Rattus non/egicus 


132 


2110-23 


Gcapl gene product 


Mus musculus 


133 


2116-64 


toll protein 


D. melanogaster 


134 


• 2124-3 


tuberin (TSC2) gene 


Homo sapiens 


135 


. 2178-55 


RAS-like protein 


Gallus gallus 


136 


> 2223-35 


Rho1 gene product 


D. melanogaster 


137 


' 2224-82 


paxillin 


Homo sapiens 


13E 


i 2225-44 


adenylyl cyclase-associated protein (CAP] 


Homo sapiens 


13S 


) 2225-80 


adenylate kinase 


Gallus gallus 


14C 


) 2110-52 


hydroxyproline-rich glycoprotein 


Phaseolus vulgaris 


141 


2115-49 


mitogen inducible gene mig-2 


Homo sapiens 


142 


> 2116-5 


F52H3.5 


C. elegans 


14C 


3 2172-89 


12D3 antigen 


Babesia bovis 


14< 


!■ 2178-20 


frameshift 


P. falciparum 


14£ 


5 2178-81 


KIAA0066 


Homo sapeins 
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Table I (cont'd) 



SEQUDIiNO:- 




Sinbank^SmoIogyllSill 




146 


2182-16 x 


r57G11C4 ( 


3. elegans 


147 


2182-53 ( 


:16C10.5 < 


Z. elegans 


148 


2211-8 I 


Jnknown 1 


Homo sapiens 


149 


2211-31 


lopothetical. protein > 


^rabidopsis thaliana 


150 


2223-54 < 


DRF YNL207W 


5. cereviciae 


151 


2224-94 ' 


4.3 kDa perchloric acid soluble protein < 


Oapra hircus 


152 


2225-36 


"ST clone i 


0. elegans 


1 I 1 w 






f* O Clf*J id l <J 


i7pn 




*A nrntpin 
1 yji uicu 1 




1791 




ihini linnl-nx/tnnhrnmp n rprii mtflQP 


l^L// If^UOuwvl (CI/ l/r / /Jr L/ CO 
J\Ji 1 IMC 


1722 


2228-1 1 


similar to mitochondrial ATPase inhibitors 


C. elegans 


1 79*3 


999ft-1 ft 


uicuive enzyme 


^- #**n// 


1 794 


999ft-1 ft 


3iH^cr\mal nrntpin 1 7 A 
~IUUoLMilcil UIUlGlM Lf M 


JiUoUfJi ilia 


1 79^ 


999ft-99 


Trrmnnin-I \A/innc 1 in A 

rufjuniiri wiiiyo ujj r\ 


-// LfoUfJl Ilia 


1 79R 


C.C.C.O'C.0 


■ 0 none nrnrli ir>t 

Lb ycllc fJlUUULrl 


LfUII 


1 797 


C.C.C.OC I 


Vf^R^91 nflno nrnHurt 




1 79ft 


9009K-99K 


ni itativo tranennrt QX/Qtpm nprmPSiQfi 

JUlCtUVC 11 CLI loUUI l oyolClll tJCI 1 1 ICuoC 

nrotpin 






2228-32 


fiar>A nrntpin 


£ co// 


1 / OO 


9928-?4 

£.£.£.0 xJ^t 


^1 itsitivp nrntpin 


Ar&hidonsis thaliana 


17^1 

I f O I 


9998-^7 

^££0*0 / 


Ada 


£ co// 


1732 


2228-39 


Titin 


H. sapiens 


1 / OO 




auenyiosuccinaie synxneiase 


/tub fTlUSUUiUo 


1 7QA 


099ft./l'3 


iransTer niNA-Mia synxneiase 


^ mo/*/ 


1 7QC^ 
1 MO 


999ft- 44 


zinc Tinyer L/iMM-uinuing pruietn 


J 1 L/jUfJI Ilia 


1 /Oo 


999ft Aft 


lerTie m. TarnesyiiransTerase 


//. oafJlcl /o 


1 7Q7 
I f Of 


999ft-C\1 




^)rnQnnh itzt 

UI UOIJJJI lltCL 


17*3ft 


9998-*^ 


DDI inHDL-PHOSPHATF 
MANNOSYLTRANSFERASE 


£ co// 


17^Q 

I r Oi? 


999ft-Rft 


tmnnnin-T 

11 UpUl III 1 1 


Dmsonhila 

LJ 1 Vuwl/I Hid 


1 74n 


9998-^Q 


nrntpin HiQtilfiHp iQrtmpraQP 


^m^nnhilfi 

LSI \s<J \JyJI 1 lIC* 


1741 


9228-63 

£.£.£.0 Uu 


nrf hvnnthptical nrotpin 


E. coli 


174.9 


999P.fift 

C.CC.KJ "OO 


ilvl nnlv/nPntiHp 


E. coli 


1 74^ 




nrf hwnnthptirfll nrntpin 




1744 


9998-79 


^poniratnrv nitrstp rpHimtflQP 1 alnha 

chain 


£ co// 


174^ 


2228-77 


lomnloo of virulence factor 


£ co// 


1746 


2228-84 


ORF0164 


£ co// 


1 747 


999ft-Q1 


nimlpar nrntpin P^-^ nrfl 


Rattijs nnn/ecticus 

1 1 0 i i L/O 1 l\y 1 rvUlwUu 


174P 


994^-fifi 


Trnnnnin C\ 

l 1 KJyJKJl Mil v 


Drn^nnh Mr 

LSI V>w \SLJl ma 


174S 


2245-70 


Predicted secreted protein 


Plasmodium falciparum 


1 /Ot 


1 99AC: 79 


oytoenrorne 1 


W con/one 


I/O 




rpoD 


ita&i 1 lUUIUI 1 1 lalLrlJJaf UI 1 1 


1752 


t 2245-78 


sarco(endo)plasmic reticulum-type 
calcium ATPase 


Helliothis virescens 


175G 


i 2246-31 


Ras-related GTP-binding protein 


H. sapiens 


175^ 


\ 2246-57 


Similar to inositol 1,4,5-triphosphate 
receptor 


C. elegans 


175J 


> 2246-61 


reverse transcriptase-like protein 


Aedes aegypti 


175( 


3 2247-13 


polyprotein 


Drosophila 
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lliNarhia:';; ( 






1757 


2247-14 ( 


DRF2 for putative reverse transcriptase 


Drosophila 


1758 


2247-42 / 


\sparaginyl tRNA Synthetase 1 


H. sapiens 


1759 


2247-44 ( 


calcium binding protein i 


Drosophila 


1760 


2247-58 : 


similar to Fibronectin type III domain < 


0. elegans 


1761 


2247-62 I 


-everse transcriptase J 


Drosophila 


1762 


2247-65 ( 


gag-iike protein i 


Culex pipiens 


1763 


2247-79 


-3-DhosDhoserine Dhosohatase 


sapiens 


1764 


2247-80 


ssterase E4 


\/lvzus Dersicae 


1765 


2247-89 


Similar to aldehvde dehvdroaenase 


3 eleoans 


1766 


2248-76 


f)~44 d rote in 


Rattus so 


17R7 


2248-85 


^DWA i^nlfltpH for thi^ nrotpin iiQinn a 

^ i—/ 1 ^t/ \ louicucu iui lino i wig 1 1 1 uoii ly ci 

monoclonal antibody directed against the 
p27k prosomal protein 




1768 


2249-3 


Projectin 


Drosophila 


1769 


2249-5 


ORFJD:o312#14 


E. colt 


1770 


2249-9 


Heat shock protein 60 


Culicoides variipennis 


1771 


2249-11 


enigma protein 


H. sapiens 


1772 


2249-12 


alpha, alpha-trehalose glucohydrolase 


Oryctolagus cuniculus 


1773 


2249-13 


small GTP binding protein 


Drosophila 


1774 


2249-14 


Spermidine/putrescine transport system 
permease 


iE. coli 


1775 


2249-19 


nueroendocrine-specific protein C 


H. sapiens 


1776 


2249-21 


a-agglutinin core subunit 


Saccharomyces cerevisiae 


1777 


2249-24 


KIAA0337 


H. sapiens 


1778 


2249-34 


su(wa) protein 


Drosophila 


1779 


2249-42 


reaulator of kdD ooeron 


£. coli 


1780 


2249-59 


No definition line found 


C. elegans 


1781 


2249-60 


oroline oxidase 

■/I UNI IC UAI\JUv3C 


Drosoohila 


1782 


2249-62 


-ormate aeetvltransf erase 

1 will lulu Clvciyilicil IOICI www 


E. coli 


1783 


2249-70 


similar to HECT-domain 

Wll 1 111 fc* 1 IV ■ ■ V-/ 1 \A \J i 1 1 U III 


C. elegans 


1784 


2249-75 


PHOSPHORIBOSYLFORMYLGLYCINAM 
IDINE CYCLO-LIGASE 


E. coli 


1785 


2249-77 


-ivnothetical 38 5 kd orotein in aoal-mtr 

1 1 y UWll IwllvUI W W t W l\\J kslVSlwIII III UUUI lllll 

intergenic region precursor 


F. coli 


1786 


2249-85 


D4L 


Variola virus 


1787 


2249-87 


similar to isocitrate dehydrogenase 


C. elegans 


1788 


2250-6 


Fii (head-tail joining; 117) 


Bacteriophage Lambda 


1789 


2250-7 


possible NAGC-like transcriptional 
regulator 


E. coli 


179C 


2250-10 


cysteine string protein 


Bos taurus 


1791 


2250-13 


Tol B protein 


E. coli 


1792 


I 2250-14 


6-phosphogluconate dehydratase 


E. coli 


1792 


» 2250-15 


6-phosphogluconate dehydratase 


E. coli 


1794 


I 2250-22 


PSST subunit of the NADH: ubiquinone 
oxidoreductase 


Bos taurus 


179£ 


5 2250-30 


sol i 3 antigen 


Solenopsis invicta 


179C 


5 2250-36 


predicted using Genefinder; similar to 
tRNA synthetases class I (E and Q 


C. elegans 


179: 


? 2250-37 


PNP 


H. sapiens 


1795 


i 2250-42 


ORF_ID:o331#2 


E. coli 


179S 


3 2250-44 


Extensin 


E. coli 
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3 enBank^HdmoIoa^^^v^fe ^f€;:^Pf ?h§^K i 




1800 


2250-47 ( 


3RF 0654 J 


z. coli 


1801 


2250-48 ( 


3caDl aene Droduet j 


\Aus musculus 


1802 


2250-52 s 


similar to human MLH1 on chromosome i 
3p21 


Vlus musculus 


1803 


2250-53 


Hvoothetical 27.6 kd Drotein in hDt-DanD J 
ntergenic region. 


z. coli 


1804 


2250-58 


JmuC protein 


E. coli 


1805 


2250-61 


U134E15.1 (Blimp-1 


H. sapiens 


1806 


2250-63 


ribosomal protein L23-related product 
lomolog 


Rattus rattus 


1807 


2250-65 


lypothetical protein MJ1 143 


E coli 


1808 


2250-68 


HI0025 homolog 


E coli 


1809 


2250-77 


R34094_1 


H. sapiens 


1810 


2250-78 


erythrocyte binding protein 


Plasmodium yoelii 


1811 


2250-79 


osmidomycin resistance protein 


E coli 


1812 


2250-81 


cyciophilin 1 


Drosophila 


1813 


2250-83 


putative glutamine synthetase 


E coli 


1814 


2251 -3 


J (taikhost specificity; 11 32) 


Bacteriophage Lambda 


1815 


2251-5 


Molybdopterin biosynthesis MoeB protein 


E coli 


1816 


2251 -6 


"o-ATP synthase subunit b 


Drosophila 


1817 


2251-9 


citrate lyase alpha chain 


E coli 


1818 


2251-10 


cuticle protein ACP65A 


Drosophila 


1819 


2251-13 


H repeat-associated protein in rhsC 
3'reqion (orf-h3 


E. coli 


1820 


2251-20 


glycine-rich protein 


Arabadopsis thaliana 


1821 


2251 -23 


2-oxoglutarate dehydrogenase precursor 


H. sapiens 


1822 


2251-29 


NFX1 


H. sapiens 


1823 


2251-32 


ebgR product, repressor 


E coli 


1824 


2251-41 


neural protein 


Drosophila 


1825 


2251-45 


similar to unidentified ORF 


E. coli 


1826 


2251-46 


NADH:ubiquinone oxidoreductase b17.2 
subunit 


Bos taurus 


1827 


2251-49 


tyrosine kinase 


Drosophila 


1828 


2251-50 


coded for by C. elegans cDNA yk89e9.5 


C. elegans 


1829 


2251-57 


H (tail component;853) 


Bacteriophage Lambda 


1830 


2251-60 


Lysyl tRNA Synthetase 


Drosophila 


1831 


2251-62 


7,8-diamino-pelargonic acid 
aminotransferase 


E coli 


1832 


2251-64 


actin related protein 


Drosophila 


1833 


2252-6 


discs-large tumor suppressor 


Drosophila 


1834 


2252-16 


S-adenosylmethionine decarboxylase 


E coli 


183E 


2252-17 


F52H3.5 


E coli 


1836 


2252-21 


translationally controlled tumor protein 


Oryctolagus cuniculus 


1837 


2252-31 


GTP binding protein 


Rattus rattus 


183£ 


\ 2252-34 


mitochondrial porin transcript 1 


Drosophila 


183S 


) 2252-38 


cuticle protein 


Manduca sexta 


184C 


) 2252-39 


Similarity to Rat CD63 antigen 


C. elegans 


1841 


2252-41 


similar to S. cerevisiae Lpg20p 


E coli 


1845 


> 2252-48 


cutE 


E coli 


184C 


J 2252-61 


Histone H3 


Spisula solidissima 


184' 


2252-66 


ea10(ssb;122) 


Bacteriophage Lambda 


184J 


3 2252-71 


Mao C protein 


E coli 
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?SEQ'IE)::NCy: 








1846 


2252-72 


miniparomyosin 


Drosophila 


1847 


2252-73 


pherophorin-S 


Volvox carter! 


1848 


2252-80 


cyclophilin 


Mus musculus 


1849 


2252-84 


alternate gene name yhhG 


E. coli 


1850 


2222-20 


nucleoporin Nup98 


rat 


1851 


2222-21 


hypothetical protein 


Escherichia coli 


1852 


2222-36 


ribosomal protein S1 1 


human 


1853 


2222-39 


hypothetical protein PFB0315w 


Plasmodium falciparans 


1854 


2222-50 


serine/threonine-specific protein k. 


Plasmodium falciparans 


1855 


2222-58 


hypothetical protein C25E10.9 


C. elegans 


1856 


2222-64 


transporting^ ATP synthase 


bovine 


1857 


2222-94 


tricarboxylate carrier 


rat 


1858 2218-95 


anoxia upregulated protein 


Drosophila melanogaster 



Q 

M 
0j 
W 
01 

~ 

ru 

Q 
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Table II represents a variety of flea HMT nucleic acid molecules of the present 
invention. Also cited in Table II are nucleic acid molecules from other organisms which 
share the closest sequence identity with the cited HMT sequences of the present 
invention, as determined by a search through the BLAST network as described above. 



TABLE II 







Jel l DaMly;;i?J vjJilv.'V.y^ J 


fyrnati i^m^^s^MM^^jy" ii ii £ 


171 


2094-23 i 


nitochondrian ATP synthase alDha subunit 


Drosophila melanogaster 


172 


2104-20 i 


Tiitochondriai ATP synthase 


Drosophila melanogaster 


173 


2105-14 i 


ATP synthase aamma-subunit j 


Homo sapiens 


174 


2167-72 


slinomvsin sensitivity conferrina orotein 

Ik \-4 \J 1 I 1 y Oil 1 W 1 Iwl Kl * 1 L y V> x> III w 1 1 1 1 1 V4 1^ 1 v L ^> 1 1 1 


Drosophila melanogaster 


175 


2179-20 


ATPase 6 


Drosophila melanogaster 






r\ i r syninase suuunii d 


OL,ni£ctfJI IIS yfcli 1 III IUI 1 1 


H *7"7 
Iff 


OOOQ /M 


r\ i r syninase aipna suuunii 


U. Iflclctl luydotcf 


•4 7Q 


OOQ"! *3K 


y ku uasic proxein 


U. fllGlal lUyct&lGl 


1 7Q 

i /y 


tcOl -4/ 


hi r syninase aipria-suuuMii 


UUo LaUl uo 


i on 
1 ou 




rniiucnonunai mi r syniriaoe buuunu y 


tUIIIU OafJI&l to 


1 O 1 




_aie ernuryuyeneois auunucini proiein 


lUca yiGLULtCk 


i no 




or-ueia rnasKiny proicir i/siraiiucu ai seconu 


iJIUoUfJl ilia 1 1 icictl luyaotci 


1 oo 




Mryonauie proxein 


/ m \lai-/liJ\jfj&iO Li \CL\\CL\ ia 






j\e urusupniia nivir d iioimcoiiu 
DrobosciDedia orotein 


\*/. Gioycii to 


185 


2092-27 


DMDHEM2 


Drosophila melanogaster 


186 


2094-21 


SelD protein 


Drosophila melanogaster 


187 


2106-11 


Unr 


Rattus norvegicus 


188 


2231-15 


cno (canoe) 


D. melanogaster 


189 


2230-79 


ALR homologue 


D. melanogaster 


190 


2232-42 


saxophone serine-threonine kinase receptor 


D. melanogaster 


191 


2232-68 


selenophosphate synthetase 


D. melanogaster 


192 


2088-11 


MMTAX107, TAX responsive element 
binding protein 


Mus musculus 


193 


2089-2 


cs Dna J-1 


Cucumis sativus 


194 


2090-7 


Lethal (2) TID 


Drosophila melanogaster 


195 


2102-33 


monocytic leukaemia zinc finger protein 


homo sapiens 


196 


2105-26 


orf 1 5' of EpoR 


Mus musculus 


197 


2106-6 


contains similarity to EGF-1 


C. elegans 


198 


2106-9 


HSP70 protein 


Ceratitis capitata 


199 


2084-60 


82 kD heat shock protein 


Drosophila pseudobscura 


20C 


2108-59 


PAR domain protein 


Drosophila melanogaster 


201 


2156-34 


yk29g12.3 


C. elegans 


202 


2161-17 


segmentation protein 


Drosophila melanogaster 


202 


i 2162-28 


heat shock protein 70, hsp70A2 


Anopheles albimanus 


204 


\ 2187-18 


Heat shock protein 70 


Anopheles albimanus 


205 


i 2173-77 


Heat shock protein hsp70 


D. melanogaster 
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SEQllDiNQ;? 


Darnells ( 


SeoBafl^B^ ..-^ < 




206 


2165-30 r 


lucleolar protein i 


Drosophila melanogaster 


207 


2165-59 c 


contains similarity to C4-type zinc fingers 


0. elegans 


208 


2177-80 ; 


line finger protein i 


Mus musculus 


209 


2181-45 I 


3 AR domain protein 1 


Drosophila melanogster 


210 


2185-9 I 


Heat shock protein-70 > 


Anopheles albimanus 


211 


2185-82 ! 


segmentation protein 


Drosophila melanogaster 


212 


2188-33 1 


ranscriptional repressor protein 


Drosophila melanogaster 


213 


2203-18 


Mastermind 


Drosophila virilis 


214 


2205-82 


high mobility group protein 1a 


Chironomus tentans 


215 


2230-26 


DNA repair protein 


D. melanogaster 


216 


2230-71 


homologue of seven in absentia 


Homo sapiens 


217 


2230-89 


nuclear speckle-type protein, SPOP 


Homo sapiens 


218 


2230-96 


heat shock protein 


D. melanogaster 


219 


2231-7 


hypothetical protein 


S.pombe 


220 


2231-38 


Rad51 homolog 


Bombyx mori 


221 


2231-81 


DNA repair protein 


D. melanogaster 


222 


2232-2 


cellular nucleic acid binding protein 


Xenopus laevis 


223 


2234-63 


neat shock protein 70 


Triohoplusia ni 


224 


2232-77 


actin-binding double-zinc-finger protein 
(abLIM) 


Homo sapiens 


225 


2234-78 


DNA-binding protein isoform 1 


D. melanogaster 


226 


2084-48 


Allantoinase 


Rana catesbeiana 


227 


2085-22 


beta-glucuronidase . 


£ coli 


228 


2094-24 


prolidase = peptidaseD/imidopeptidase 


Mus musculus 


229 


2088-43 


branched chain alpha-keto acid 
dehydrogenase E1-beta subunit 


Bos taurus 


230 


2086-29 


3-hydroxyisobutyrate dehydrogenase 


Dictyostelium discoideum 


231 


2088-5 


Rab 5c protein 


Canis familiaris 


232 


2095-17 


cytochrome P-450 


Heliothis virescens 


233 


2102-16 


carbamoyl phosphate synthetase II 


Plasmodium falciparans 


234 


2102-48 


NADPH cytochrome P450 reductase 


Musca domestica 


235 


2104-15 


branched chain alpha-keto acid 
dehydrogenase 


Rattus norvegicus 


236 


2106-5 


Metallothionein 


Strongylocentrotus 
purpuratus 


237 


2106-47 


peroxidoxin-1 


Dirofilaria immitis 


238 


2107-17 


tetracycline transporter-like protein 


Mus musculus 


23S 


2107-58 


allergen Bla g 5 (glutathione-S-transferase) 


Blattella germanica 


24C 


2156-58 


HAL-3 homologue 


Arabidopsis thaliana 


241 


2195-90 


aminoacyclase-1 


Homo sapiens 


242 


! 2171-55 


NADPH— ferrihemoprotein reductase 


Drosophila melanogaster 


245 


\ 2169-30 


hypothetical protein 


Synechocystis sp 


244 


2169-52 


insulin degrading enzyme 


Drosophila melanogaster 


24E 


2177-64 


3-hydroxyisobutyrate dehydrogenase 


Rattus norvegicus 


246 


5 2181-69 


Endonexin 


Bos taurus 


247 


7 2138-25 


glutamate dehydrogenase 


Drosophila melanogaster 


24? 


) 2230-28 


glutathione -S-transferase 


Anopheles gambiae 


245 


} 2191-8 


lactase-phlorizin hydrolase 


Rattus rattus 


25C 


) 2193-52 


cytochrome P450 


Heloithis virescens 
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251 


2202-35 J 


jlutathione-S-transf erase > 


Anopheles gambiae 


252 


2229-77 c 


jlutathione-S-transferase > 


Anopheles gambiae 




OOOQ OH | 


Jialc UXIUabc J 


-/. 1 1 iclal iUyclotci 


OKA 


OOQ1.MO c 


superoxide uiornmdse 


f*ar\/i to o/anh/ /c 
L/C/ vUo vfafJf luo 




C.C.OC-- ( t < 


allergen Did y o j 




256 


2234-42 ( 


jlutathione reductase family 


Musca domestica 


257 


2087-8 ( 

< 


:ystic fibrosis transmembrane 
conductance regulator 


Homo sapiens 


258 


2087-23 1 


Slervous system antigen 2 


Drosophila melanogaster 


259 


2091-56 , 


adenosine triphosphatase 


Homo sapiens 


260 


2094-20 


sodium pump, alpha suhbunit 


Ctenocephalides felis 


261 


2095-51 


similar to Hrs 


C. elegans 


262 


2103-24 


N-methyl-D-aspartate receptor-associated 
protein 


Drosophila melanogaster 


263 


2105-55 


nward rectifying K channel 


Sus scrofa 


264 


2105-63 


EF-hand Ca2+ binding protein p22 


Rattus norvegicua 


265 


2106-62 


Dents disease candidate gene product 


Homo sapiens 


266 


2167-50 


PKD1 (polycystic kidney disease 1) 


Fugu rubripes 


267 


2185-37 


copper-transporting ATPase 


Archaeoglobus fulgidus 


268 


2193-29 


TrkG Potassium transport protein 


E. coli 


269 


2195-33 


silicon transporter 


Cylindrotheca fusiformis 


270 


2202-16 


similarity to human sulfate anion transporter 


C. elegans 


271 


2230-2 


sulfate transporter 


Arabidopsis thaliana 


272 


2230-69 


mitochondrial porin 


D. melanogaster 


273 


2231-22 


muscarinic acetylcholine receptor 


D. melanogaster 


274 


2231-24 


p97 subunit of 15S Mg(2+)- ATPase 


Xenopus laevis 


275 


2231-32 


anion transporting ATPase 


Aquifex aeolicus 


276 


2231-70 


sulfate permease 


Schizosaccharomyces 
pombe 


277 


2231-94 


putative Na/H exchanger 


S.pombe 


278 


2233-6 


plasma membrane Ca2+-ATPase 2 


Mus musculus 


279 


2233-24 


chloride channel gene, CLIC2 


Homo sapiens 


280 


2085-61 


beta-type protein kinase C 


Bos taurus 


281 


2089-20 


cGMP-dependent protein kinase 


Drosophila melanogaster 


282 


2092-12 


Btk 


Homo sapiens 


283 


2093-64 


Receptor-like protein tyrosine phosphatase 


Drosophila melanogaster 


284 


\ 2095-31 


frt (fms-related tyrosine kinase gene) 


Homo sapiens 


28E 


» 2094-58 


casein kinase II beta 


Oryctolagus cuniculus 


286 


i 2103-54 


ORF YGL084C 


Saccharomyces cerevisiae 


287 


' 2106-42 


protein phosphatase epsilon subunit 


Homo sapiens 


28£ 


i 2156-5 


serine/threonine kinase 


Rattus norvegicus 


28S 


) 2157-95 


cGMP-dependent protein kinase 


Drosophila melanogaster 


29( 


) 2165-80 


ABL gene product 


Gailus gallus 


291 


2165-63 


diadenosine tetraphosphatase 


Homo sapiens 


292 


» 2167-17 


adenylate cyclase 


S. cereviciae 


29C 


3 2177-44 


serine/threonine kinase 


C. elegans 


29< 


\ 2188-16 


weakly similar to serine/threonine kinase 


C. elegans 


295 


5 2191-60 


carbohydrate kinase, pfkB family 


Archaeoglobus fulgidus 
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296 


2195-22 


urotein kinase 


Drosophila melanogaster 


297 


2196-30 


calcium-dependent protein kinase / 


1 thaliana 


298 


2205-83 | 


Drotein kinase/endoribonulcease (IRE1) / 


Homo sapiens 


299 


2205-87 i 


'eceptor tyrosine phosphatase / 


Hirudo medicinalis 


300 


2229-11 I 
i 


nagnesium-dependent calcium 
nhibitable phosphatase 


Bos taurus 


301 


2229-29 | 


Dhosphoglycerate kinase 


Schistosoma mansoni 


302 


2229-74 


Dyruvate kinase i 


D. melanogaster 


303 


2230-55 


serine/threonine specific protein phosphatase J 


D. melanogaster 


304 


2230-57 


stress activated MAP kinase kinase 3 


D. melanogaster 


305 


2231-64 


alkaline phosphatase 


D. melanogaster 


306 


2231-91 


olynucleotide phosphorylase 


Yersinia enterocolitica 


307 


2232-43 


protein kinase PkwA 


Thermomonospora curvata 


308 


2234-94 


serine/threonine kinase ULK1 


Homo sapiens 


309 


2085-18 


3 yridoxamine phosphate oxidase 


C. elegans 


310 


2094-13 


sphingomyelin phosphodiesterase 


Mas musculus 


311 


2105-47 


apolipoprotein E receptor 2 


Homo sapiens 


312 


2092-38 


squalene synthetase 


Homo sapiens 


313 


2094-25 


: atty acid synthetase 


Rattus norvegicus 


314 


2089-32 


coproporphyrinogen oxidase 


Homo sapiens 


315 


2085-46 


HADHB mitochondrial Afunctional protein 
beta subunit 


Homo sapiens 


316 


2104-56 


pyridoxal kinase 


Homo sapiens 


317 


2107-30 


Phosphomevalonate kinase 


Homo sapiens 


318 


2154-70 


very-long chain acyl-CoA dehydrogenase 


Mus musculus 


319 


2191-85 


stearyl-CoA desaturase 


Cyprinus carpio 


320 


2192-44 


very-long-chain Acyl-CoA dehydrogenase 


Rattus norvegicus 


321 


2195-55 


Similar to LDL receDtor-related Drotein 


C. elegans 


322 


2229-82 


inase-3 


D. melanogaster 


323 


2231-59 


^hosDhatidvlethanolamine-bindino Drotein 

1 1 IVwU 1 IWiLlw V 1 w kl 1 1 1 1 Uil 1 1 1 1 1 v III 111 \U fcrf 1 Vp/ W VpS III 


Macaca fascicularis 


324 


2233-25 


similaritv to 

yeast ethanolaminephosphotransferase 


Q t elegans 


325 


2233-41 


cellular retinoic acid binding protein 
(mCRABP) 


Manduca sexta 


326 


2087-61 


1 allergen 


Lepidoglyphus destructor 


327 


2087-41 


chloroquine resistance candidate protein 


Plasmodium falciparum 


328 


\ 2089-51 


Xenopus Bf B 


Xenopus laevis 


32S 


I 2086-58 


repeat organellar protein 


Plasmodium falciparum 


33C 


) 2090-45 


heat shock cognate protein 


Drosophila melanogaster 


331 


2104-23 


40 kDa heat shock chaperone protein 


Deinococcus 


332 


> 2107-26 


Luciferase 


Photuris pennsylvanica 


332 


i 2162-46 


F20D1.9 


C. elegans 


33* 


\ 2162-49 


PKR inhibitor P58 


Bos taurus 


33£ 


5 2162-93 


GroES homologue 


Ricketsia 


336 


3 2171-46 


NH2 terminus uncertain 


Leishmania tarentolae 


33/ 


7 2089-10 


beta adaptin 


Drosophila melanogaster 


33? 


3 2229-24 


non-functional folate binding protein 


Homo sapiens 


33£ 


3 2229-25 


calmodulin B 


Halocynthia roretzi 
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340 


2229-31 


DUtative T1/ST2receptor binding protein < 


0. elegans 


341 


2229-36 < 


alpha-crystallin cognate protein 25 J 


Plodia interpunctella 


342 


2229-40 


Defensin / 


Apis mellifera 


343 


2229-86 ( 


glutamate — ammonia ligase j 


D. melanogaster 


344 


2231-49 


nelanoma-associated antigen ME491 J 


Homo sapiens 


345 


2231-76 


histone C 


Drosophila virilis 


346 


2232-65 


translationally controlled tumor protein 


Oryctolagus cuniculus 


347 


2232-84 


Apyrase 


Aedes aegypti 


348 


2232-85 


KIAA0124 


Homo sapiens 


349 


2233-59 


Glutamine-dependent 
carbamoyl-phosphate synthase 


C. elegans 


350 


2233-86 


ANG12 precursor 


Anopheles gambiae 


351 


2234-11 


tissue specific secretory protein 


Pan troglodytes 


352 


2234-76 


methionine adenosyltransferase 


D. melanogaster 


353 


2089-13 


Synaptic vessicle protein 2 form B 


Rattus norvegicus 


354 


2159-52 


glycoprotein 56 


Rattus norvegicus 


355 


2084-6 


CLN3; homologue of the gene underlying 
Batten disease 


Mus musculus 


356 


2085-10 


Amphiphysin 


Gallus gallus 


357 


2156-39 


glycoprotein 55 


Rattus norvegicus 


358 


2104-59 


Transmembrane transporter 


Discopyge ommata 


359 


2105-9 


insect intestinal mucin II 


Trichoplusia ni 


360 


2106-14 


kinesin-like protein 


D. melanogaster 


361 


2107-45 


Lazarillo precursor 


Schistocerca americana 


362 


2156-3 


clathrin-associated protein 


Mus musculus 


363 


2161-46 


neural variant mena+ protein 


Mus musculus 


364 


2171-92 


Malvolio 


Drosophila melanogaster 


365 


2175-18 


homolog of SYT - synaptotagmin 


Mus musculus 


366 


2177-10 


GABA receptor subunit (Rdl) 


Aedes aegypti 


367 


2181-10 


neurexin IV 


Drosophila melanogaster 


368 


2191-92 


synaptic vessicle protein 2B 


Rattus norvegicus 


369 


2229-18 


Synaptic vessicle protein 2A 


Rattus norvegicus 


370 


2194-38 


gamma-subunit of mouse nerve growth factor 


Mus musculus 


371 


2230-60 


lin-7-C 


Rattus norvegicus 


372 


2230-81 


PDZ domain protein 


Homo sapiens 


373 


2234-5 


Gcapl gene product 


Mus musculus 


374 


2234-55 


Gcapl gene product 


Mus musculus 


37E 


2234-71 


Gcapl gene product 


Mus musculus 


376 


i 2085-34 


Liver-specific transport protein 


Rattus norvegicus 


377 


' 2087-15 


polyspecific organic cation transporter 


Homo sapiens 


37E 


\ 2204-80 


transmembrane transporter 


Discopyge ommatta 


37S 


I 2093-39 


liver-specific transport protein 


Rattus norvegicus 


38C 


) 2093-46 


similar to monocarboxylate transporter family 


C. elegans 


381 


2092-22 


similar to matrin F/G 


C. elegans 


382 


> 2103-50 


Unknown 


Drosophila melanogaster 


38J 


I 2103-51 


organic cation transporter 


Ratttus norvegicus 


38' 


\ 2197-35 


renal organic cation transporter 


Oryctolagus cuniculus 


38J 


5 2156-17 


sulfate anion transporter 


Manduca sexta 
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386 


2166-84 


_X1 I 


Mus musculus 


387 


2167-94 


VICT (monocarboxylate transporter) J 


Homo sapiens 


388 


2196-83 l 


-enal organic cation transporter 


Oryctolagus cuniculus 


389 


2229-83 j 


similarity to monocarboxylate transporter 1 


C. elegans 


390 


2231-89 


3olgi 4-transmembrane spanning transporter 
MTP 


Mus musculus 


391 


2158-8 


phosphate carrier protein 


C. elegans 


392 


2085-14 


ADP/ATP translocase 


Drosophila melanogaster 


393 


2085-17 


Na+-dependent inorganic 
phosphatase cotransporter 


Drosophila melanogaster 


394 


2088-38 


ADP/ATP translocase 


Bos taurus 


395 


2092-50 


ADP/ATP translocase 


Drosophila melanogaster 


396 


2104-21 


Na(+)-dependent inorganic 
phosphate cotransporter 


Drosophila melanogaster 


397 


2121-55 


phosphate carrier protein 


C. elegans 


398 


2105-64 


phosphate carrier protein 


Homo sapiens 


399 


2102-6 


ZK512.6 


C. elegans 


400 


2108-27 


mitochondrial phosphate carrier protein 


Homo sapiens 


401 


2194-63 


mitochondrial phosphate transporter 


Rattus norvegicus 


402 


2196-14 


phosphate/triose-phosphate 
translocator precursor 


C. elegans 


403 


2204-11 


EST clone 


D. melanogaster 


404 


2085-16 


Chymotrypsin I 


Anopheles gambiae 


405 


2085-54 


Chymotrypsin II 


Anopheles gambiae 


406 


2086-12 


Plasminogen 


Homo sapiens 


407 


2086-18 


Trypsin eta 


Drosophila melanogaster 


408 


2090-21 


Trypsin 


Manduca sexta 


409 


2092-15 


Alp1 


Cochliobolus carbonum 


410 


2102-11 


vitellin-degrading protease 


Bombyx mori 


411 


2102-17 


Chymotrypsin II 


Anopheles gambiae 


412 


2102-51 


chymotrypsin -like protease 


Anopheles gambiae 


413 


2103-31 


Beta trypsin 


Drosophila erecta 


414 


2107-22 


Factor IX 


Rattus norvegicus 


415 


2108-29 


Trypsin 


Anopheles Stephens! 


416 


2157-15 


Trypsin 


Choristoneura fumiferana 


417 


2160-34 


Aminopeptidase 


Synechocystis 


418 


2160-36 


E01G6.1 


C. elegans 


419 


2103-62 


plasminogen activator inhibitor 2 


Mus musculus 


420 


2167-36 


factor IX 


Oryctolagus cuniculus 


421 


2167-67 


Alp1 


Cochliobolus carbonum 


422 


2169-51 


Trypsin 


Aedes aegypti 


423 


2181-27 


Chymotrypsin Bll 


Penaeus vannamei 


424 


2185-69 


plasma prekallikrein 


Homo sapeins 


425 


> 2187-20 


pre-procathepsin L 


Paragonimus westermani 


426 


i 2188-45 


vitellin-degrading protease 


Bombyx mori 


427 


r 2192-91 


late trypsin precourser 


Culex pipiens 
quinquefasciatus 


426 


I 2196-10 


SPC2 


Branchiostoma 
californiensis 


42S 


> 2196-88 


Trypsin 


Anopheles stephensi 
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430 


2204-9 ( 


:arnitine/choIine acetyltransferase 


C. elegans 


431 


2229-7 i 


ota trypsin 


D. melanogaster 


432 


2229-22 " 


Frypsin 


Anopheles gambiae 


433 


2229-89 " 


frypsin 


Anopheles gambiae 


434 


2229-94 


ate trypsin precourser 


Culex pipiens 
quinquefasciatus 


435 


2230-59 


Chymotrypsin 1 


Anopheles gambiae 


436 


2230-67 


sarboxypeptidase A 


Drosophila heteroneura 


437 


2231-62 


aminopeptidase N 


Sus scxrofa 


438 


2231-74 


imulus factor C serine protease 


Tachypleus tridentatus 


439 


2232-15 


cysteine proteinase 


Sitophilus zeamais 


440 


2232-25 


Carboxypeptidase 


Simulium vitatum 


441 


2232-33 


putative aspartic protease 


Brassica oleracea 


442 


2233-46 


aminopeptidase N 


Pleuronectes americanus 


443 


2233-85 


chymotrypsin 1 


Anopheles gambiae 


444 


2233-90 


""rypsin 


Anopheles Stephens'! 


445 


2233-94 


Dreprechymotrypsin 1 


Penaeus vannamei 


446 


2234-29 


chymotrypsin-like protease precursor 


Aedes aegypti 


447 


2234-58 


Mutative 


C. elegans 


448 


2234-61 


carboxylesterase precursor 


Aphis gossypii 


449 


2234-68 


serine protease inhibitor i 


Schistocerca gregaria 


450 


2084-35 


Integral membrane protein 


Mus musculus 


451 


2086-45 


similar to beta-ureidopropionase of Rat 


C. elegans 


452 


2087-54 


Cyclin 


Mus musculus 


453 


2088-22 


Esp 8 


Mus musculus 


454 


2091-16 


contains similarity to EGF-like domains 


C. elegans 


455 


2091-29 


multiple exostosis-like protein 


Homo sapiens 


456 


2091-30 


apoptosis 1 inhibitor 


Drosophila melanogaster 


457 


2092-33 


KIAA0023 (putitive oncogene) 


Homo sapiens 


458 


2095-35 


G coupled receptor 


C. elegans 


459 


2095-3 


Go (heterotrimeric guanyl nucleotide 
binding protein alpha subunit) 


Manduca sexta 


460 


2085-4 


gp 150 protein 


Drosophila melanogaster 


461 


2103-28 


leukotriene A4 hydrolase 


Rattus sp. 


462 


2105-62 


putitive orf 


Homo sapiens 


463 


2107-6 


activator protein 


Drosophila melanogaster 


464 


2107-28 


platelet-endothelial tetraspan antigen 3 


Homo sapiens 


465 


2189-3 


oligopeptidase A (prIC) 


Haemopholis influenzea 


466 


2156-54 


fibroblast growth factor receptor 


Xenopus laevis 


467 


2160-92 


contains similarity to EGF-like domains 


C. elegans 


468 


2160-65 


weak similarity to the drosophila hyperplastic 
disc protein 


C. elegans 


46S 


2165-53 


inositol triphosphate receptor 


Rattus norvegicus 


47C 


) 2166-22 


placental protein 1 1 


Homo sapiens 


471 


2166-92 


elongation factor 1 alpha-like 


Drosophila melanogaster 


472 


t 2181-34 


DSch 


Drosophila melanogaster 


475 


I 2192-65 


STAM, signal transducing adaptor molecule 


Homo sapiens 


474 


2194-24 


ATPases associated with various cellular 
activities (AAA family) 


Arabidopsis thaliana 
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475 


2196-75 ! 


similar to cell division control protein 


C. elegans 


476 


2230-38 


EST clone 


S. cereviciae 


477 


2230-39 


MTPase 


D. melanogaster 


478 


2230-66 , 


adenylyl cyclase aggregation protein 


Dictyostelium discoideum 


479 


2230-80 


sphingomyelin phosphodiesterase 


C. elegans 


480 


2231-29 


nuclear antigen H731 


Homo sapiens 


481 


2231 -40 


suppressor of actin mutation 2 


Homo sapiens 


482 


2231-66 


DET1 


Arabidopsis thaliana 


483 


2232-7 


Calreticulin 


D. melanogaster 


484 


2232-38 


activator protein 


D. melanogaster 


485 


2232-69 


ornithine decarboxylase 


Gallus gallus 


486 


2233-32 


similar bHLH-PAS 


D. melanogaster 


487 


2233-45 


rab1 


D. melanogaster 


488 


2234-2 


C10A gene product 


Mus musculus 


489 


2234-72 


QM homolog 


D. melanogaster 


490 


2084-17 


ntegral membrane protein 


Herpesvirus-2 


491 


2091-4 


endomembrane protien EMP70 precourser 
isolog 


Arabidopsis thaliana 


492 


2102-45 


Ylr251wp 


Saccharomyces cerevisiae 


493 


2162-68 


220 kDa silk protein 


Chironomus thummi 


494 


2160-47 


precursor HT7 protein 


Gallus gallus 


495 


2161-12 


peritrophin 95 precourser 


Lucilia cuprina 


496 


2161-15 


yk86g11.5 


C. elegans 


497 


2171-12 


51 A surface protein 


Paramecium tetraurelia 


498 


2173-18 


hypothetical - mitochondrial membrane 
transport protein 


Schizosaccharomyces 
pombe 


499 


2087-32 


est sequence 


C. elegans 


500 


2091-19 


Similar to P. aeruginosa hypothetical protein 


C. elegans 


501 


2192-86 


tyrosine kinase 


Drosophila melangaster 


502 


2086-42 


M04B2.4 


C. elegans 


503 


2088-16 


glycoprotein 330 


C. elegans/Human 


504 


2088-39 


EST sequence 


Arabidopsis thaliana 


505 


2088-57 


Yer126cp 


Saccharomyces cereviciae 


506 


2089-25 


similar to S. cereviciae hypothetical 
protein YKL166 


C. elegans 


507 


2090-3 


EST sequence 


Saccharomyces cereviciae 


508 


2090-53 


EST sequence 


C. elegans 


509 


2095-20 


Chloroplast ORF 


Marchantia polymorpha 


510 


2102-28 


similar to S. cerevisiae hypothetical 
protein YKL1 66 


C. elegans 


511 


2102-55 


D1 054.3 


C. elegans 


512 


2102-58 


ZC513.5 gene product 


C. elegans 


513 


2105-44 


E 1087 protein 


Saccharomyces cerevisiae 


514 


2109-24 


F11C1.5 


C. elegans 


515 


• 2154-21 


disulfide-like protein 


Acanthamoeba castellanii 


516 


i 2156-6 


ZK470.1 


C. elegans 


517 


' 2156-18 


BIIIA3 


Ovis aries 


516 


I 2156-27 


AFR1 


S. cereviciae 


51 £ 


I 2165-94 


COS41.8 


Ciona intestinalis 
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520 


2167-65 


EST sequence, function unknown 


C. elegans 


521 


2171-93 


KIAA0160 


Homo sapiens 


522 


2175-45 


ORF YJR83.18 


S. cereviciae 


523 


2185-66 


rps4 


Plasmodium falciparum 


524 


2195-40 


C27C12.4 


C. elegans 


525 


21 96-20 


glycoprotein A 


Pneumocystis carinii 


526 


2205-89 


BKRF1 encodes EBNA-1 protein 


Epstein Ban virus 


527 


2229-19 


D4L 


Variola virus 


528 


2230-35 


KIAA0747 


Homo sapiens 


529 


2231-8 


13 


Mus musculus 


530 


2231-78 


unknown protein 


Arabidopsis thaliana 


531 


2232-49 


Similarity to Yeast hypothetical 52.9 KD 
protein 


C. elegans 


532 


2232-52 


tetratricopeptide repeat protein (tpr2) 


Homo sapiens 


533 


2233-5 


similar to Saccharomyces cerevisiae 
SCD6 protein 


C. elegans 


534 


2233-22 


cDNA EST yk486b9.3 


C. elegans 


535 


2233-93 


CDC27Dm 


D. melanogaster 


536 


2084-34 


mmune suppressorAZ-ATPase 115 kDa 
subunit 


Mus musculus 


537 


2086-30 


V-ATPase A-subunit 


Aedes aegypti 


538 


2087-45 


H+ ATPase 


Drosophila melanogaster 


539 


2088-55 


40-kDa-V-ATPase subunit 


Manduca sexta 


540 


2088-62 


vacuolar ATPase subunit A 


Drosophila melanogaster 


541 


2091-26 


proton-ATPase-like protein 


Homo sapiens 


542 


2091-31 


vacuolar ATPase subunit A 


Drosophila melanogaster 


543 


2092-20 


vacuolar ATPase 115 kDa subunit 


Homo sapiens 


544 


2095-18 


similar to S. cereviciae vacuolar H(+)-ATPase 
54 kD subunit 


C. elegans 


545 


2095-54 


H (+)-transporting ATPase subunit B 


Manduca sexta 


546 


2108-8 


similar to S. cereviciae 54 kDa V-ATPase 
subunit 


C. elegans 


547 


2154-36 


V-ATPase subunit E 


Drosophila melanogaster 


548 


2154-76 


V-ATPase subunit A (new fragment) 


Aedes aegypti 


549 


2166-32 


V-ATPase C subunit 


Drosophila melanogaster 


550 


2166-33 


vacuolar (V-type) H(+)-ATPase B subunit 


Helicoverpa virescens 


551 


2166-90 


beta subunit of ATPase 


Schizaphis graminum 


552 


2161-5 


ATPase 1 


Plasmodium falciparum 


553 


2171-24 


similar to V-ATPase 1 16kd subunit 


C. elegans 


554 


2169-82 


V-ATPase subunit E 


Drosophila melanogaster 


555 


2187-36 


V-ATPase membrane sector associated 
protein M8-9 


Homo sapiens 


556 


2188-91 


V-ATPase subunit A 


Candida tropicalis 


557 


2230-88 


vacuolar ATPase G subunit 


Manduca sexta 


558 


2232-61 


V-ATPase subunit C 


D. melanogaster 


559 


2086-52 


Penelope transposable element ORF 


Drosophila virilis 


560 


2103-2 


genome polyprotein gene product 


Plum pox virus 


561 


2106-8 


pol protein 


Human T-cell lymphotropic 
virus type 2 


562 


2108-41 


reverse transcriptase, Doc retroposon 


Drosophila melanogaster 
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Table II fcont'tf) 



SEQ:(IDIN0:1 








563 


2202-28 


Polyprotein 


Hepatitis virus C 


564 


2165-95 


DNA polymerase 


Choristoneura biennis 
entomopoxvirus 


565 


2169-81 


reverse transcriptase 


Drosophila melanogaster 


566 


2181-36 


reverse transcriptase 


Anopheles gambiae 


1416 


2240-4 


alpha-L-fucosidase precursor 


Homo sapiens 


1417 


2240-11 


estrogen related receptor alpha 


Mus musculus 


1418 


2240-14 


NADH:ubiquinone oxidoreductase 51 -kD 
subunit 


Homo sapiens 


1 zl1 Q 


00 AH 17 


jeniropnin i 


Anopheles gambiae 


1 Aon 




email C*^ "T~ D o <*> /-> rc/^'i K 
blTlcUl O 1 raSc faC i D 


Homo sapiens 


1 AOi 


OOACi 9Q 


oympieKin 


Homo sapiens 


1 A00 


OOACxJOR 


riDosomai protein Lou 


Bos taurus 


1 AO*X 

1 HC.O 


OOACi OQ. 


ouo niDosomai rrotein mkl iua 


Homo sapiens 


1 AO A 


OOACi OQ 


rMiNi / protein 


D. melanogaster 


1 AOtL 


OOACi Q1 


euKaryotic initiation Tactor 4 gamma 


Homo sapiens 


1 A9P 


OOACi QO 


ornttnme aecarooxyiase antizyrne 


0. melanogaster 




OOAf\ AA 


electron transfer f lavoprotein 


Rattus norvegicus 


1 /lOQ 


OOAH CQ 


toi cione 


C. elegans 


•i a on 

1429 


2240-55 


glutathione reductase family 


Musca domestica 


1430 


2240-58 


chymotrypsin-like serine protease 


C. felis 


1431 


2240-63 


ferritin subunit 1 


D. melanogaster 




^240-b4 


vacuolar ATPase subunit B 


D. melanogaster 


1433 


2240-66 


chaperonin containing TCP-1 delta 


Fugu rubripes 


1 434 


2240-70 


1 -acyl-glycerol-3-phosphate acyltransferase 


Zea mays 


1 4oO 


oo/in 71 


to 1 Clone f\L\jd\ lUb 


D. melanogaster 


l HOD 


OOACi 70 


o/oaa long nypotneticai oenyarogenase 


Pyrococcus horikoshii 


1 AO/7 


OOACi 77 


cnymotrypsin-iiKe serine protease 


o. tens 


14^8 




FST rlnnp 




1439 


2240-R3 


phvmntrvn^in-likp Qprinp nrntPPQP 

\*i iyi i i\j\i y ^joii i Mr\c odiiic? iJiVJLCCloC 


H Mia 


1440 


2240-90 


rv/tnrhrnmp P4^0 


n mp/ijnn/TfiJc/pr 


1441 


2240-93 


enhancer-traD-locu^-1 

CI II lul IV-/ v I 11 Ci|k/ IWUUg 1 


hAti^ mi isci ////Q 


1442 


2240-94 


alvcerol-3-DhosDhate dehvdronenasp 














OOAA Q 


ro-n precourser 


uienocepnaiiues reus 




OQ/11 c: 


trypsin-iiKc serine proiease 


Lfienocepnauues reus 




99/11 -7 


rnyuopneroiu. protein 


Ly. meianogasier 


1446 


2241-10 


Sam50 


D. melanogaster 


1447 


2241-12 


NADH dehydrogenase subunit 2 


Chorthippus pa rail el us 


1448 


2241-15 


putative protein 


Arabidopsis thaliana 


1449 


2241-16 


contains EGF-like repeats 


C. elegans 


1450 


2241-20 


Gcapl gene product 


Mus musculus 


1451 


2241-25 


Na(+)-dependent inorganic phosphate 
cotransporter 


D. melanogaster 


1452 


2241-31 


D4L 


Variola virus 


1453 


2241-36 


plenty-of-prolines-101; POP101; SH3-philo- 
protein 


Mus musculus 


1454 


2241-40 


EF-1 -alpha 


D. melanogaster 
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Table II (cont'd) 



FC-6-C1 







«.e u dho K:; niorri Q lQgyj;it:k'i rr^yv^i^iv-y. ■ ; 




1455 


2241-44 

&£.t 1 tt 


n nir oyrunase epsiion-suounil 


ipom063 D3T3T3S 


1 *tJU 




iiuoouiiiai protein 0^0 


Homo sapiens 


1457 


9241-55 


V-hnv nrAtain 

t "UUa protein 


D. melanogaster 


1 *tJO 


9941 -5R 


biiun-criain aiconoi oenyarogenase 


Homo sapiens 


I *TUU 


9941 -5Q 

ttt 1 Ui7 


out Ha 11 ib 0 cysteine ri en repeats 


C. elegans 


1460 


2241-60 

Cbt 1 UU 


Miuoum lypti pnu5pnuTruciOKina.se 


Canis familiaris 


1461 


2241-61 


-Ipat chnplf nrntoin PO 


Mus musculus 


1462 


2241-65 


phv/mntrvncin-lil^o nrntoaco 
ui lyi 1 lun yuoii i-iifte pruiodoe 


o. reus 


1463 


2241-66 

C£-*T 1 UU 


r^ilinncnr'phci n/ltra nof ora eo eiiKi mit 
wiiyuocLiti/f idl yiu dl lot ci doc oUOUnil 


D. melanogaster 


1 tut 


9941-70 


— QT" rlnno 
L—O 1 UlUl ic 


D. melanogaster 


1 4fio" 

1 tUJ 


9941-79 


diieu axon connections proiein 


D. melanogaster 


i too 


9941-74 


cnoiaoe 


Hymenolepis diminuta 


1467 


2241-78 


multiple exostosis 2 protein 


Mus musculus 


1468 


2241-80 


Protein on Ecdysone Puffs 


D. melanogaster 


1469 


2241-82 


paramyosin 


D. melanogaster 


1470 


2241-83 


beta-tubulin 


Bombyx mori 


1471 


2241-84 


natural killer cell enhancing factor 


Cyprinus carpio 


1472 


2241-86 


similar to MYOTUBULARIN-RELATED 
PROTEIN 


Homo sapiens 


1 4/u 


OO/H Q7 


Renin 


Rattus norvegicus 


1 A7A 


oo/h on 
-yu 


Myopniim 


Echinococcus 
mui itiocuia ns 










1 47^ 


994^-1 0 


aipna-acunin 


1 

D. melanogater 


1 47R 




monocarooxyiaie iransponer 


Homo sapiens 


1 477 


994*3-1 Q, 


yKc.f oai U.O 


C. elegans 




OOAO.1 C 


seienium aonor protein 


Homo sapiens 


1 47Q 


0 

t^'fO- i 0 


aceiyi-ooM syninetase 


D. melanogater 




994^-90 


cyiocnrome r'rou UYricAo 


Musca domestica 


14R1 




iNMun uenyaroyenase suDunu 4 


Anopheles arabiensis 


14R9 


994^-97 


Pol\/i thiol lifin 
r UiyUUIU^UIUi 1 


f * **f ^\ ^\^t till ft 9*1 t 

L/ncewius g rise us 


14R^ 

1 *TUU 


994*3-98 

Cbtu bU 


IVIUUoll 1 


l/. meianogaier 


1484 

i tut 


994^-^1 


nrntoin 


oomuyx manuanna 


i *tUJ 


994^-^9 


^pp9*^ nrntoin 


no mo sapiens 


1 48fi 
1 tuu 


994^-^7 


tn mnsatorl nrntoin 
llUIludlcU piuiciii 


o. cereviciae 


1 4R7 
i to/ 


994^-^8 


1 1 UJcOUf \ 


D. melanogater 


1488 


2243-39 


Unknown 


Homo sapiens 


1489 


2243-41 


similar to enoyl-CoA hydratase 


C. elegans 


1490 


2243-45 


similar to dehydrogenase 


C. elegans 


1491 


2243-46 


trypsin-like serine protease 


C. felis 


1492 


2243-48 


Merlin 


Rattus norvegicus 


1493 


2243-52 


GTP-specific succinyl-CoA synthetase beta 
subunit 


Homo sapiens 


1494 


2243-53 


sod Drotein (suDeroxide dismutase^ 


DrosoDhila virili^ 

LSI UJUpl I lid VII Hi J 


1495 


2243-54 


trypsin-like serine protease 


C. felis 


1496 


2243-61 


chymotrypsin-like serine protease 


C. felis 


1497 


2243-66 


TagB 


Dictyostelium discoideum 


1498 


2243-67 


hypothetical protien 


Arabidopsis thaliana 


1499 


2243-68 


heat shock cognate protein 70 


Trichoplusia ni 
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Table II (cont'd) 



FC-6-C1 



SEQ-ID-NQ: 



1500 
1501 



1502 2243-86 



1503 2243-87 



1504 2243-91 sodium/iodide s vmporter 

•i CftC nrtAn rtr» /M-if-« ~ 



1505 



1506 
1507 
1508 



2243-92 



2243- 94 

2244- 12 
2244-19 



2243-72 TRIP-1 homologue 

2243-73 cytosolic NADP-dependent isocitrate 
dehydrogenase 



progesterone-induced protein 



Bmsqd-2 



ORF2 



lysosomal beta-galactosidase 
tropomyosin isoform 127 
KIAA0181 



P. melanogater 
Microtis mexicanis 



Oryctolagus cuniculus 



Bombyx mori 



Homo sapiens 



_ Acidianus ambivalens 



Felis cattus 
P. melanogaster 
Homo sapiens 



2244-23 



151 



2244-29 



plasma membrane calcium ATPase isoform 1 



NADH dehydrogenase 



Homo sapiens 



Bos taurus 



151 
1512 



2244-44 



2244-54 



glutamate dehydrogenase 



spliceosomal protein 



P. melanogaster 



P. melanogaster 



151 
151 



2244-59 
2244-61 



ciliary body glutathione peroxidase 



Bos taurus 



pyridoxal-phoshate-dependent 
aminotransferases 



C. elegans 



2244-64 



Unknown 



1516 



2244-69 



trypsin-like serine protease 



Rattus norvegicus 



C. felis 



1517 



2244-71 



peritrophin 1 



Anopheles gambiae 



1518 



2244-75 



NADH dehydrogenase subunit 5 



Anopheles gambiae 



1519 



2244-84 



microsomal epoxide hydrolase 



Rattus norvegicus 



1520 



2244-86 



C54G7.2 gene product 



C. elegans 



1521 



2244-91 



Aminopeptidase N 



Plutella xylostella 



1522 



2253-2 



cytochrome C oxidase 



1523 



2253-13 



nitiation factor 5A 



H. sapiens 



Gallus gallus 



1524 



2253-14 



1525 



2253-16 



protein phos phatase type 2A catalytic subunit 



myosin light chain 2 



Bos taurus 



P. melanogaster 



1526 



2253-18 



cDNA EST yk462d1.5 



C. elegans 



1527 



2253-19 



ribosomal protein. S10 



H. sapiens 



1528 



2253-24 



aspartvKasp araginyQbeta-hydroxylase, HAAH 



H. sapiens 



1529 



2253-27 



arval and adult myosin heavy chain 



P. melanogaster 



1530 



2253-33 



nervous system antigen 2 



P. melanogaster 



1531 



2253-36 



J366N23.2 



H. sapiens 



1532 



2253-40 



hrp48.1 



P. melanogaster 



1533 



2253-42 



ZnT-1 



Mus musculus 



1534 



2253-43 



aminopeptidase N 



Manduca sexta 



1535 



2253-56 



rofilin 



P. melanogaster 



1536 
1537 



2253-59 



26A5. 



H. sapiens 



2253-68 



ADH-ubiquinone oxidoreductase 42 kDa 
subunit 



P. melanogaster 



1538 



2253-78 



ycine-rich protein 



1539 



2253-81 



■nucleotidase 



H. sapiens 



1540 



2253-86 



utathione S-transferase 



Anopheles gambiae 



1541 



2253-87 



ferritin subunit 1 



P. melanogaster 



1542 



2253-92 



myosin light chain 2 



P. melanogaster 



1543 
1544 



2253-94 



xylose-proton symport 



E. coli 



2254-4 



lature-parasite-infected erythrocyte surface 
antigen 



P. falciparum 
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Table II (cont'd) 



SEQ IDaO: 


NamS| : •■..';.;( 


SenBSnfcft 




1545 


2254-6 


"o-ATP synthase subunit b 


0. melanogaster 


1546 


2254-13 


similar to Arabidopsis thaliana male sterility 
Drotein 2 


C. elegans 


1547 


2254-17 


3LN3 protein 


H. sapiens 


1548 


2254-21 1 


/bgG 


B. subtilis 


1549 


2254-25 


Deroxisomal protein 


Synechocystis sp 


1550 


2254-27 


Slutaminase 


Rattus norvegicus 


1551 


2254-30 


tartan protein 


D. melanogaster 


1552 


2254-33 


eucine zipper-EF-hand containing 
transmembrane protein 1 


H. sapiens 


1553 


2254-39 


similar to helicase 


C. elegans 


1554 


2254-43 


muscle myosin heavy chain 


D. melanogaster 


1555 


2254-45 


putative nicotinate phosphoribosyltransferase 


N. tabacum 


1556 


2254-51 


60S ribosomal protein 


Mus musculus 


1557 


2254-54 


small nuclear riboprotein Sm-D 


H. sapiens 


1558 


2254-55 


nucleoside diphosphate kinase 


Salmo salar 


1559 


2254-60 


serine protease 


C. felis 


1560 


2254-63 


myospheroid protein 


D. melanogaster 


1561 


2254-65 


Carboxylesterase 


Anisopteromalus 
calandrae 


1562 


2254-66 


siah binding protein 1 


H. sapiens 


1563 


2254-70 


vacuolar ATPase, subunit M9.7 


Manduca sexta 


1564 


2254-83 


Fumarylacetoacetate hydrolase 


Rattus norvegicus 


1565 


2254-84 


metalloproteinase 1 


Hydra vulgaris 


1566 


2254-88 


alpha-spectrin 


D. melanogaster 


1567 


2254-93 


NADH dehydrogenase subunit 6 


D. melanogaster 


1568 


2254-96 


cyclophilin isoform 5 


C. elegans 


1569 


2255-5 


similar to mitochondrial ATPase inhibitors 


C. elegans 


1570 


2255-8 


yk391f12.5 


C. elegans 


1571 


2255-12 


Unknown 


H. sapiens 


1572 


2255-17 


ribonucleotide reductase subunit M1 


M. musculus 


1573 


2255-19 


docking protein 


H. sapiens 


1574 


2255-22 


Similar to rat 5E5 antigen 


H. sapiens 


1575 


2255-23 


ribosomal protein S31 


D., melanogaster 


1576 


2255-25 


Similar to acyl-CoA dehydrogenase 


C. elegans 


1577 


2255-28 


Arginine tyrosine kinase 


H. sapiens 


1578 


2255-32 


ribosomal protein L7a 


D., melanogaster 


1579 


2255-33 


chS-Rex-s 


G. gallus 


1580 


2255-39 


Phosphoacetylglucosamine mutase 


C. elegans 


1581 


2255-41 


k 1 A r*\ If 1 1 1 1 'in 

NADH dehydrogenase subunit 6 


D t/ melanogaster 


1582 


2255-45 


x ft. 1 A It * tlx 

tRNA-glutamine synthetase 


C. elegans 


1583 


2255-46 


p68 


M. musculus 


1584 


2255-49 


ABC8 


M. musculus 


A CQC 

looc 




Kynurenme aminoxransrerase 


h. rauus 


1586 


2255-51 


SmD homolog {Gly-Arg repeat} 


M. musculus 


1587 


r 2255-56 


epoxide hydrolase 


S. scrofa 


158£ 


2255-60 


Sec23 protein 


H. sapiens 


158E 


I 2255-62 


HMG CoA synthase 


M. musculus 


159C 


) 2255-63 


dipeptidyl aminopeptidase-like protein 6 


M. musculus 
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Table II (cont'd) 



SEQ;JD*NQ:^ 


NamejM* 


GenBan&Ho.mology^;^ 




A ro A 

1591 


oocr f> o 

2255-66 


retinal rod Na+/Ca+, K+ exchanger 


H. sapiens 


•4 COO 

1592 


2255-67 


4-hydroxybutyrate coenzyme A transferase 


C. elegans 


■i coo 


OOCC 7A 


iD54+ins2 isoform 


H. sapiens 


1594 


2255-73 


chromaffin granule ATPase II homolog 


M. musculus 


1595 


2255-77 


40S ribosomal protein S10 


H. sapiens 


1596 


2255-79 


34/67 kD laminin binding protein 


S. purpuratus 


1597 


2255-82 


RNA-binding protein lark 


D., melanogaster 


1598 


2255-86 


thiol-specific antioxidant protein 


ft norvegicus 


1599 


2256-7 


Similar to Human estrogen-responsive finger 
protein 


H. sapiens 


1600 


2256-1 1 


Trypsin 


C. felis 


1601 


2256-12 


CEV14 


H. sapiens 


1602 


2256-16 


AL021475 


C. elegans 


1603 


2256-21 


Heterogenous Nuclear Ribonucleoprotein C1 


H. sapiens 


1604 


2256-22 


b4 integrin interactor 


H. sapiens 


1605 


2256-28 


testis enhanced gene transcript protein 


H. sapiens 


1606 


2256-31 


synaptic vesicle protein 2B 


ft norvegicus 


1607 


2256-40 


TNF-alpha stimulated ABC protein 


H. sapiens 


1608 


nnp o ji o 

2256-42 


carboxypeptidase A 


H. armigera 


1609 


2256-46 


pherophorin S 


V. carteri 


1610 


ooco c*o 

2256-52 


Fo-ATP synthase subunit b 


D. melanogaster 


1611 


2256-54 


PDGF associated protein 


H. sapiens 


1612 


OO f" O CO 

2256-58 


S20 ribosomal protein 


D. melanogaster 


A o m n 

1613 


2256-64 


ribosomal protein S9 


ft sapiens 


A C A A 

1614 


2256-69 


elongation factor 1 -gamma 


Artemia sp 


1615 


OOCO "TO 

2256-70 


conserved hypothetical protein 


S. pombe 


A o A o 

1616 


OOCO "TO 

2256-72 


fructose 1,6 bisphosphate-aldolase 4C 


D, melanogaster 


1617 


2256-73 


troponin-T 


D. melanogaster 


1618 


2256-80 


SRP14 


C. familiaris 


1619 


2256-82 


succinyl-CoA synthetase alpha subunit 


S. scrofa 


1620 


2256-89 


Csa-19 


ft sapiens 


1621 


2256-92 


Sacm21 


M. musculus 


1622 


2256-94 


apoptosis inhibitor 


Cydia pomonella 
granulosis virus 


1623 


2256-96 


ribosomal protein L22 


D. melanogaster 
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FC-6-C1 

Table m represents a variety of flea HNC nucleic acid molecules of the present 
invention. 



Table m 



SlQIdNO:' 






$EQ ID N)(5: 


^Sliiiilillli 


567 


2096-1 9NB.HNC 




601 


S*K J J j\ A A ft | I I ft ■ 

21 16-24NB.HNC 


568 


2096-25NB.HNC 




602 


2116-27NB.HNC 


569 


2096-48NB.HNC 




603 


21 16-41 NB.HNC 


570 


2096-50NB.HNC 




604 


2116-59NB.HNC 


571 


2096-52NB.HNC 




605 


2116-64NB.HNC 


572 


2096-55NB.HNC 




606 


2117-05NB.HNC 


573 


2097-09NB.HNC 




607 


2117-09NB.HNC 


574 


2097-1 5NB.HNC 




608 


2117-11NB.HNC 


575 


2097-20NB.HNC 




609 


2117-53NB.HNC 


576 


2097-22NB.HNC 




610 


2118-03NB.HNC 


577 


2097-32NB.HNC 




611 


2122-39NB.HNC 


578 


2097-45NB.HNC 




612 


2123-25NB.HNC 


579 


2097-46NB.HNC 




613 


2124-40NB.HNC 


580 


2097-47NB.HNC 




614 


2124-62NB.HNC 


581 


2097-56NB.HNC 




615 


o H n H nnKin 1 IK I /~\ 

2131-22NB.HNC 


582 


2097-64NB.HNC 




OHO 

616 


OHOH nnMn UKIA 

2131-32NB.HNC 


583 


2098-04NB.HNC 




o h -7 

617 


oh oo h rMn Li K. 1 /"> 

2132-15NB.HNC 


584 


2098-40NB.HNC 




OHO 

618 


oh oo oomd UMr* 

2132-28NB.HNC 


585 


2098-43NB.HNC 




619 


OH OO POMD 1 JM /"> 

2132-63NB.HNC 


586 


2099-9NB.HNC 




620 


OH OO HMD L 1 K 1 ^> 

2132-9NB.HNC 


587 


2100-10NB.HNC 




621 


0^07 HOMO UMP 

21 37-1 9ND.HNU 


coo 
boo 


OHnn ylCMD UMP 

21 00-45Nb.nNO 




\JCC- 




589 


2100-47NB.HNC 




623 


2138-05NB.HNC 


590 


2100-56NB.HNC 




624 


2138-51NB.HNC 


591 


2100-63NB.HNC 




625 


21 39-31 NB.HNC 


592 


211 0-41 NB.HNC 




626 


2139-41 NB.HNC 


593 


2110-53NB.HNC 




627 


2139-60NB.HNC 


594 


2112-12NB.HNC 




628 


2140-13NB.HNC 


595 


2112-35NB.HNC 




629 


2140-15NB.HNC 


596 


2113-17NB.HNC 




630 


2140-18NB.HNC 


597 


2115-16NB.HNC 




631 


2140-54NB.HNC 


598 


2115-22NB.HNC 




632 


2141-16NB.HNC 


599 


2115-3NB.HNC 




633 


2141-59NB.HNC 


600 


2116-19NB.HNC 




634 


2142-16NB.HNC 
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Table HI (cont'cD 



FC-6-C1 



SE&ll$l! 




635 


2142-18NB.HNC 


coe 
DOD 


2143-06NB.HNC 


COT 

537 


OH ylO 0~7MO UMO 

2143-07NB.HNC 


coo 

638 


2143-33NB.HNC 


639 


2143-54NB.HNC 


O >1 O. 

640 


OHPO O^ktD i IK IP 

2168-06ND.HIMC 


641 


OHOf* nnMH LJMA 

2168-09NB.HNC 


642 


oH£>r» AnKtn ijm/^ 

2168-42NB.HNC 


643 


2168-79NB.HNC 


C A A 

644 


oh on nnMn 1 ikio 

2168-82NB.HNC 


645 


2170-04NB.HNC 


646 


OHTO rtnMn 1 IMO 

2170-08NB.HNC 


647 


2170-82NB.HNC 


648 


2172-39NB.HNC 


649 


OHTPO rnMD 1 1 k 1 

2172-59NB.HNC 


C r\ 

650 


oh*70 /*» 0 k 1 n 1 1 K 1 

2172-60NB.HNC 


651 


OH"70 TTlkin 1 1 K 1 /"N 

2172-77NB.HNC 


652 


2174-14NB.HNC 


- 653 


2174-17NB.HNC 


654 


nA*rrA j j 1 in t 1 k I ^\ 

2174-41NB.HNC 


655 


2174-49NB.HNC 


0 cc 

656 


Aj*7i rAMn 

2174-59NB.HNC 


657 


r> a —r a pnKin MM/% 

2174-68NB.HNC 


658 


O H ~70 OH KID MKir» 

21 76-21 NB.HNC 


659 


OH7C O /I K 1 1"~> l_J MP 

2176-34NB.HNC 


bbU 


2176-47NB.HNO 


bb 1 


OH7C CCMD LJM/"* 


ceo 
bb*i 


OH7C COMD UMO 


bbo 


OH7C COMD 1 iKir> 


Db4 


OH"7C C/1KID LJMf* 


665 


OH7C oriin LJKIO 

2176-65NB.HNU 


ODD 


£. \ ( D"/ OIND.fUNvy 


667 


2178-05NB.HNC 


668 


2178-13NB.HNC 


669 


2178-23NB.HNC 


670 


2178-25NB.HNC 


671 


21 78-41 NB.HNC 


672 


2178-56NB.HNC 


673 


2178-57NB.HNC 







674 


2178-58NB.HNC 


675 


2178-67NB.HNC 


676 


2178-72NB.HNC 


677 


2178-78NB.HNC 


678 


2178-80NB.HNC 


679 


2178-90NB.HNC 


680 


2178-91NB.HNC 


681 


2178-95NB.HNC 


682 


2180-05NB.HNC 


683 


2180-18NB.HNC 


684 


2180-20NB.HNC 


685 


2180-32NB.HNC 


686 


2180-59NB.HNC 


687 


2180-62NB.HNC 


688 


2180-74NB.HNC 


689 


2180-78NB.HNC 


690 


2180-79NB.HNC 


691 


2180-88NB.HNC 


692 


2180-90NB.HNC 


693 


2182-07NB.HNC 


694 


2182-12NB.HNC 


695 


2182-13NB.HNC 


696 


2182-27NB.HNC 


697 


2182-2NB.HNC 


698 


2182-46NB.HNC 


699 


2182-55NB.HNC 


700 


2182-57NB.HNC 


701 


2182-63NB.HNC 


702 


2182-64NB.HNC 


703 


2182-83NB.HNC 


704 


2182-86NB.HNC 


"70C 

705 


OH OO OOMD L1MO 

2182-88NB.HNC 


706 


2182-90NB.HNC 


707 


2182-92NB.HNC 


708 


2182-94NB.HNC 


709 


2184-15NB.HNC 


710 


2184-37NB.HNC 


711 


2184-65NB.HNC 


712 


2186-14NB.HNC 
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Table lit fcont'dt 



FC-6-C1 



SEQID.NQ: 


Name ^rtfeiiiSllii 


719 


91flfi.4^MR UMP 
d \ OD-'fOIND.niNo 


714. 


91RR.c:nMn UMP 
d I OD-OUIND.rlNL/ 


71 ^ 
/ I o 


91ftfi KOMR UMP 


71 R 


91ftfi CflMR UMP 
d I OD-DUiND.nlNO 


717 


91ftfi R9MR UMP 


71 fl 


91ftfi RQMR UMP 
d \ OO-DOlND.niMO 


71 Q 

/ i y 


91ftfi CQMD UMP 
d I OO-DOlND.nlNO 


yon 

/ d\J 


91Q£ CQMD LIMP 


791 
/ d I 


991 1 1 OMR UMP 


790 

/ dd 


991 1 OQMR UMP 
dd \ I -^OlND.niNO 


/ dO 


9911 9QMD UMP 


~7DA 

f dH 


991 1 QfiMR UMP 
dd I l -OUIND.nlNLf 


79 e; 


99*1 i /I'OMD LIMP 

ddi \ -4oNd.MNo 


79 ft 


991 1 COMD UMP 


797 
( df 


99"M C/1MD UMP 


79P 


9919 OHMD UMP 
dd 1^-OVJIND.nlNL/ 


79Q 

/ <£y 


9919 OHMD UMP 

dd\ d-o i iNb.nlNO 


7*3 f"\ 
/ OU 


OO-fO 7HMD UMP 


7Q1 
/ Ol 


99-19 70MD UMP 


7Q9 
/ Od 


9919 70MD UMP 
dd\d~l oINtD.nNO 


799 


9919 Q1MD UMP 




9919 QCMR UMP 


79^ 
/ OO 


9919 ft7MR UMP 


79fi 
/ OO 


9919-Q1MR UMP 


797 

/ Of 


9919-QRMR UMP 


79ft 

/ OO 


9919-QMR UMP 


79Q 
/ Oi? 


991*5 nflMR UMP 
dd I O-UOlND.niNLr 


7Afl 
f *f VJ 


991*5 flQMR UMP 

dd i o-uyiND.niNO 


7/11 


991 9-1 1 MR UMP 




9919-19MR UMP 


749 

/ *fO 


9919-1AMR UMP 


744 


2213-34NB HNC 


745 


2213-53NB.HNC 


746 


2213-58NB.HNC 


747 


2213-67NB.HNC 


748 


2213-79NB.HNC 


749 


2214-02NB.HNC 


750 


2214-03NB.HNC 


751 


2214-05NB.HNC 







752 


2214-07NB.HNC 


753 


2214-15NB.HNC 


754 


2214-23NB.HNC 


755 


2214-30NB.HNC 


756 


2214-36NB.HNC 


757 


2214-37NB.HNC 


758 


2214-40NB.HNC 


759 


2214-43NB.HNC 


760 


2214-53NB.HNC 


761 


2214-57NB.HNC 


762 


2214-60NB.HNC 


763 


2214-61NB.HNC 


764 


2214-73NB.HNC 


765 


2214-76NB.HNC 


766 


2214-80NB.HNC 


767 


2215-07NB.HNC 


768 


2215-15NB.HNC 


769 


2215-31 NB.HNC 


770 


2215-41 NB.HNC . 


771 


2215-51 NB.HNC 


772 


2215-80NB.HNC 


773 


f\ A -A ^ ^ ft t ^\ 1 1 ft 1 

2215-85NB.HNC 


774 


221 5-91 NB.HNC 


775 


2217-14NB.HNC 


776 


2217-16NB.HNC 


1 J v 1 7 
111 


2217-33NB.HNC 


—7-7 0 

778 


2217-39NB.HNC 


779 


2217-78NB.HNC 


780 


2217-92NB.HNC 


781 


2218-15NB.HNC 


"7 O O 

782 


2218-19NB.HNC 


/ 00 


991Q OfiMR UMP 
dd\ CJ-^OlND.nlNO 


784 


2218-36NB.HNC 


785 


2218-41 NB.HNC 


786 


2218-56NB.HNC 


787 


2218-58NB.HNC 


788 


2218-69NB.HNC 


789 


2218-71 NB.HNC 


790 


2218-76NB.HNC 



Table III (cont'd^ 





IN a m e^i^-vf^ 


791 


2218-77NB HNC 


792 


2218-84NB HNC 


793 


2218-96NB HNC 


794 


2219-11NB HNC 


795 


2219-13NB HNC 


796 


2219-17NB HNC 


797 


221 9-1 91MB HNC 


798 


2219-20NB HNC 


799 


2219-22NB HNC 


800 


2219-23NR HNC 


801 


2219-32NB.HNC 


802 


2219-45NB.HNC 


803 


2219-49NB.HNC 


804 


221 9-51 NB.HNC 


805 


2219-72NB.HNC 


806 


2219-80NB.HNC 


807 


2219-952122- 




39NB HNIC 




2220-02NR HNIC 


808 


2220-02NB HNIC 

C~f—C—\J \JC\ MLJ.I II NW 


809 


2220-27NB HNC 


810 


2220-3PNR HNC 


811 


2220-53NB HNC 


812 


2220-60NB HNC 


813 


2220-66NB HNC 


814 


2221-06NB HNC 


815 


2221-15NB HNC 


816 


2221-18NB HNC 


817 


2221-20NB HNC 


818 


2221-24NB HNC 


819 


2221-45NB HNC 


820 


2221-46NB HNC 


821 


2221-48NB HNC 


822 


2221-54NB.HNC 


823 


2221-55NB.HNC 


824 


2221-59NB.HNC 


825 


2221 -61 NB.HNC 


826 


2221-62NB.HNC 


827 


2221-70NB.HNC 


828 


2221-86NB.HNC 



FC-6-C1 





IN a m e ^m^i r - ws^. 


829 


2221-87NB.HNC 


830 


2221-95NB.HNC 


831 


2223u-18NB.HNC 


832 


2223U-22NB.HNC 


833 


2223U-23NB.HNC 


or* a 

834 


2223U-31 NB.HNC 


835 


2223u-33NB.HNC 


836 


2223U-36NB.HNC 


837 


2223u-67NB.HNC 


838 


o o o o . . opMn i u 

2223U-85NB.HNC 


839 


2224U-05NB.HNC 


840 


2224U-07NB.HNC 


841 


OOO A . ■ 4 AKin 1 IM/N 

2224u.-10NB.HNC 


842 


ooo a . . j j Kin i iv ir« 

2224U-11 NB.HNC 


843 


OOO A . . <l rKin 1 

2224U-15NB.HNC 


O A A 

844 


ooo a . . nrkin i it ir\ 

2224u-25NB.HNC 


845 


2224u-27NB.HNC 


846 


2224U-44NB.HNC 


o A~r 

847 


2224U-52NB.HNC 


848 


OOO A . . o O R 1 1 - 1 1 ||l |r« 

2224U-62NB.HNC 


849 


2224U-70NB.HNC 


OCA 

850 


2224U-71 NB.HNC 


851 


2224U-79NB.HNC 


852 


2225U-11 NB.HNC 


853 


o o o r* . . o o iv f n i \ k \ 

2225U-20NB.HNC 


854 


2225U-23NB.HNC 


855 


2225U-28NB.HNC 


856 


2225U-55NB.HNC 


857 


2225U-59NB.HNC 


858 


2225U-64NB.HNC 


859 


2225U-77NB.HNC 


860 


2225U-95NB.HNC 


861 


2226-932122- 
39NB.HNC 


862 


2226U-07NB.HNC 


863 


2226U-19NB.HNC 


864 


2226U-39NB.HNC 


865 


2226U-45NB.HNC 


866 


2226U-49NB.HNC 
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Table III (cont'd) 





! Naffillllliilil 


867 


' 2226U-54NB.HNC 


868 


2226U-71NB.HNC 


869 


2226U-77NB.HNC 


870 


2226U-83NB.HNC 


871 


2226U-91NB.HNC 


872 


2227U-12NB.HNC 


873 


2227U-13NB.HNC 


874 


2227U-23NB.HNC 


875 


2227U-26NB.HNC 


876 


2227U-30NB.HNC 


877 


2227U-31NB.HNC 


878 


2227u-33NB.HNC 


879 


2227U-43NB.HNC 


880 


2227U-51NB.HNC 


881 


2227U-60NB.HNC 


882 


2227U-93NB.HNC 


883 


2228U-04NB.HNC 


884 


2228U-09NB.HNC 


885 


2228U-12NB.HNC 


886 


2228U-21NB.HNC 


887 


2228U-26NB.HNC 


888 


2228U-49NB.HNC 


889 


2228u-'54NR HMP 


onn 
oyu 


oooOn ccMn i iilia 

2228U-55NB.HNC 


891 


2228U-61NB.HNC 


892 


2228U-65NB.HNC 


893 


2228U-79NB.HNC 


894 


2228U-90NB.HNC 


1624 


2222-7 


1625 


2222-16 


1626 


2222-19 


1627 


2222-39 


1628 


2222-56 


1629 


2222-59 


1630 


2222-79 


1631 


2222-89 


1632 


2228-4 


1633 


2228-9 


1634 


2228-12 


1635 


2228-21 


1636 


2228-26 


1637 


2228-49 



bLU ID NO: 




1638 


f 2228-54 


163S 


> 2228-61 


164C 


2228-65 


1641 


2228-79 


1642 


2228-90 


1643 


2245-5 


1644 


2245-7 


1645 


2245-15 


1646 


2245-16 


1647 


2245-17 


1648 


2245-20 


1649 


2245-35 


1650 


2245-38 


1651 


2245-39 


1652 


2245-51 


1653 


2245-52 


1654 


2245-57 


1655 


2246-13 


1656 


2246-19 


1657 


2246-25 


1658 


2246-27 


1659 


2246-29 


1660 


2246-40 


1661 


2246-45 


1662 


2246-52 


1663 


2246-64 


1664 


2246-66 


1665 


2246-74 


1666 


2246-82 


1667 


2247-6 


1668 


2247-17 


1669 


2247-29 


1 670 


2247-31 


A 0*74 

1 671 


2247-36 


1 672 


2247-40 


1 673 


2247-46 


1 674 


2247-50 


I D/O 


2247-54 


1676 


2247-63 


1677 


2247-66 


1678 


2247-68 


1679 


2247-69 


1680 


2247-81 


1681 


2247-82 


1682 


2247-95 


1683 


2248-7 
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Table III fcont'dl 



FC-6-C1 



D 

w 

W 

a 





1684 


2248-18 


1685 


2248-32 


1686 


2248-41 


1687 


2248-50 


1688 


2248-54 


1689 


2248-60 


1690 


2248-62 


1691 


2248-65 


1692 


2248-86 


1693 


2248-94 


1694 


2249-6 


1695 


2249-30 


1696 


2249-35 


1697 


2249-36 


1698 


2249-68 


1699 


2249-74 


1700 


2249-79 


1701 


2250-20 


1702 


2250-24 


1703 


2251-7 


1704 


2251-21 


1705 


2251-25 


1706 


2251-38 


1707 


2251-58 


1708 


2252-7 


1709 


2252-15 


1710 


2252-19 


1711 


2252-24 


1712 


2252-26 


1713 


2252-27 


1714 


2252-32 


1715 


2252-36 


1716 


2252-37 


1717 


2252-69 


1718 


2252-78 
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# 



FC-6-G1 



Table IV represents a variety of flea HMT nucleic acid molecules of the present 
invention. 



Table IV 



G 

n 

a 

H 

m 

a 



OCU IU iNU, 


Name ; /-Ss!^ 




* OHQ/1 HO U K ATK 1 D 


Oc?C 


> ^Uo4-Uo.nM 1 Nd 




' OHQA f\~7 UIKATMO 

^Uo4-U/.nM 1 Nd 


oyc 


*:Uo4-Uy\HM INB 


ftQC 

OSS 


OClQA -1C U K ATK I D 

<:Uo4- 1 o.HMTNB 


yuu 


^Uo4-i7.HMTNB 


yu i 


^Uo4-1 o.HMTNB 


one 
yu^ 


*:Uo4-21. HMTNB 


yuo 


*£0o4-22. HMTNB 


QH/l 

yu*i 


<:Uo4-30. HMTNB 


ant; 


£Uo4-33. HMTNB 


yuo 


<:Uo4-3o. HMTNB 


Q/V7 

yu/ 


£U84-37. HMTNB 


yuo 


^Uo4-3o\HMTNB 


yuy 


On.Q/1 Oft LIKJITMO 

£l)o4-39. HMTNB 


01 n 
y t u 


^084-43.HMTNB 


y i 


£Uo4-bU. HMTNB 


y t £ 


<£Uo4-54. HMTNB 


y 1 0 


OClQA EC UMTMD 

^Uo4-5o. HMTNB 


y i «+ 


OHO/ CO LJ |\ ATKtd 

*iUo4-by.HM I NB 


Q1 ^ 

v7 I U 


tUoo-Uo.HM 1 No 


y 1 0 


£iUoo- 10.MIVI 1 Nd 


Q17 


cUoo-oo.HM 1 Nd 


Q1 ft 


tUoo-oo.nlvl 1 Nd 




^uoo-oy.nivi 1 Nd 


qpo 


tuoo-4y.nivi 1 Nd 


Q21 


tuoo"Oo.n(vi \ nd 


92P 


POft^-Rft WMTMQ 

fcuoo-oo.nivi 1 nd 




tuoo-o i .MM I Nb 


924 


2086-05.HMTNB 


925 


2086-1 O.HMTNB 


926 


2086-1 3.HMTNB 


927 


2086-1 5.HMTNB 


928 


2086-20.HMTNB 


929 


2086-25.HMTNB 


930 


2086-32.HMTNB 


931 


2086-33.HMTNB 


932 


2086-34.HMTNB 


933 


2086-37.HMTNB 


934 


2086-41. HMTNB 


935 


2086-43.HMTNB 





936 


i 2086-44.HMTNB 


937 


' 2086-54.HMTNB 


93G 


I 2086-55.HMTNB 


93S 


2086-58.HMTNB 


94C 


2087-09.HMTNB 


941 


2087-1 7.HMTNB 


942 


2087-28.HMTNB 


943 


2087-33.HMTNB 


944 


2087-35.HMTNB 


945 


2087-51 .HMTNB 


946 


2087-54.HMTNB 


947 


2088-07.HMTNB 


948 


2088-1 7.HMTNB 


949 


2088-35.HMTNB 


950 


2088-52.HMTNB 


951 


2088-59.HMTNB 


952 


2089-1 2.HMTNB 


953 


2089-1 4.HMTNB 


954 


2089-33.HMTNB 


955 


2089-36.HMTNB 


956 


2089-51. HMTNB 


957 


2089-60.HMTNB 


958 


2090-11. HMTNB 


959 


2090-27.HMTNB 


960 


2090-33.HMTNB 


961 


2090-44.HMTNB 


962 


2090-57.HMTNB 


963 




964 


2091-22.HMTNB 


965 


2091-23.HMTNB 


966 


2091-35.HMTNB 


967 


2091-63.HMTNB 


968 


2092-1 1 .HMTNB 


969 


2092-1 6.HMTNB 


970 


2092-40.HMTNB 


971 


2092-42.HMTNB 


972 


2092-46.HMTNB 


973 


2092-60.HMTNB 


974 


2093-20.HMTNB 


975 


2093-23.HMTNB 


976 


2093-43.HMTNB 
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Table IV (cont'd) 



SEQ ID NOS 


Naiffe^^ 


977 


2093-48 HMTNB 


978 


2093-50 HMTNB 


979 


2093-62. HMTNB 


980 


2093-63. HMTNB 


981 


2094-08. HMTNB 


982 


2094-26 HMTNB 


983 


2094-33 HMTNB 


984 


2094-47.HMTNB 


985 


2094-50.HMTNB 


986 


2094-62.HMTNB 


987 


2095-04.HMTNB 


988 


2095-1 0.HMTNB 


989 


2095-12 HMTNB 


990 


2095-13 HMTNB 


991 


2095-15 HMTNB 

^vvv 1 w.l IIVI 1 1 1L> 


992 


2095-20 HMTNB 


993 


2095-22 HMTNB 

*—\J^J**J W • 1 IIVI 1 1 11 Lai 


994 


2095-31 HMTNB 

t.^wv^ \J I (I IIVI 1 llU 


995 


2095-33 HMTNB 

WWWW V*/ W ft 1 IIVI 1 1 ^1 ^/ 


996 


2095-34 HMTNB 


997 


2095-36 HMTNB 

WWWW WW.I IIVI 1 1 


998 


2095-40 HMTNB 


999 


2095-48 HMTNB 


1000 


2102-12.HMTNB 


1001 


2102-16 HMTNB 


1002 


2102-18 HMTNB 

1 v *— 1 \»* • 1 IIVI 1 1 T| L*^ 


1003 


2102-19 HMTNB 

fc» 1 1 W ■ 1 IIVII |T| \—f 


1004 


2102-20 HMTNB 

* W V/ ft 1 IIVII |^| 


1005 


2102-29 HMTNB 

*•* 1 WW bVS.I IIVI I 1 1U 


1006 


2102-35 HMTNB 

*— ■ ■ \J f— \J W* 1 IIVII IH L>« 


1007 


2102-37. HMTNB 


1008 


2102-38.HMTNB 


1009 


2102-41. HMTNB 


1010 


2102-47.HMTNB 


1011 


2103-02.HMTNB 


1012 


2103-09.HMTNB 


1013 


21 03-45. HMTNB 


1014 


21 03-56. HMTNB 


1015 


2103-58.HMTNB 


1016 


21 04-58. HMTNB 

1 W * Vp** W il IIVII 1^1 


1017 


21 04-60. HMTNB 

■ W T W tl IIVII 1 ^ 


1018 


2104-61 HMTNB 

*— ■ ^ * W 1*1 IIVII |T| LaS 


1019 


2105-02.HMTNB 


1020 


2105-20.HMTNB 


1021 


2105-35.HMTNB 


1022 


2105-42.HMTNB 


1023 


2105-44.HMTNB 


1024 


2106-05.HMTNB 



SEQ^IP;NP:^ 




1025 


2106-27.HMTNB 


1026 


2106-29.HMTNB 


1027 


2106-34.HMTNB 


1028 


2106-48.HMTNB 


1029 


2106-50.HMTNB 


1030 


2106-64.HMTNB 


1031 


2107-02.HMTNB 


1032 


2107-1 0.HMTNB 


1033 


2107-37.HMTNB 


1034 


2108-03.HMTNB 


1035 


2108-23.HMTNB 


1036 


2108-46.HMTNB 


1037 


2108-47.HMTNB 


1038 


2108-48.HMTNB 


1039 


2108-49.HMTNB 


1040 


2108-63.HMTNB 


1041 


2109-04.HMTNB 


1042 


2109-06.HMTNB 


1043 


2109-37.HMTNB 


1044 


2109-38.HMTNB 


1045 


2109-44.HMTNB 


1046 


2154-08.HMTNB 


1047 


2154-09.HMTNB 


1048 


2154-10.HMTNB 


1049 


2154-28.HMTNB 


1050 


2154-30.HMTNB 


1051 


2154-45.HMTNB 


1052 


2154-46.HMTNB 


1053 


2154-61. HMTNB 


1054 


21 54-71 .HMTNB 


1055 


21 54-81 .HMTNB 


1056 


2154-83.HMTNB 


1057 


2156-02.HMTNB 


1058 


2156-06.HMTNB 


1059 


2156-18.HMTNB 


1060 


2156-27.HMTNB 


1061 


2156-43.HMTNB 


1062 


2156-48.HMTNB 


1063 


2156-50.HMTNB 


1064 


2157-16.HMTNB 


1065 


2157-34.HMTNB 


1066 


2157-45.HMTNB 


1067 


2157-70.HMTNB 


1068 


2157-75.HMTNB 


1069 


2157-79.HMTNB 


1070 


2157-86.HMTNB 


1071 


2158-02.HMTNB 


1072 


2158-14.HMTNB 



Table IV (cont'd) 



SEQIDiNO^ 


^ a rri e ■ . :> 


1073 


2158-19.HMTNB 


1074 


2158-22 HMTNB 


1075 


2158-27.HMTNB 


1076 


2158-34.HMTNB 

9 \—J T • 1 |1T|| IV 


1077 


2158-37 HMTNB 


1078 


2158-39.HMTNB 


1079 


2159-07.HMTNB 


1080 


2159-09.HMTNB 


1081 


2159-17.HMTNB 


1082 


2159-34.HMTNB 


1083 


2159-35 HMTNB 


1084 


2159-60 HMTNB 


1085 


2160-16 HMTNB 


1086 


2160-17 HMTNB 

1 WW 1 f .1 1 I V I 1 I1U 


1087 


2160-29 HMTNB 


1088 


2160-30.HMTNB 


1089 


2160-32.HMTNB 


1090 


2160-39.HMTNB 


1091 


2160-49.HMTNB 


1092 


2160-53.HMTNB 


1093 


2160-54.HMTNB 


1094 


2160-55 HMTNB 


1095 


2160-77 HMTNB 

1 V f # » 1 IIVl 1 IV 


1096 


2160-82 HMTNB 

b 1 Vp/£ha • 1 |IVI I 1 V 


1097 


2160-89 HMTNB 

b» 1 WW WWtl IIVl 1 1 1 w 


1098 


2160-91 HMTNB 

fa 1 WW Will IIVl 1 1 1 ki* 


1099 


2161-13 HMTNB 

k_ 1 W 1 1 U>l Mil 1 1 1 fc^ 


1100 


2161-19 HMTNB 

Iwl 1 W • 1 IIVl ■ |t| I—* 


1101 


2161-45 HMTNB 

£■ I i r \*f * i iivi i iv <u«/ 


1102 


2161-57 HMTNB 

1 1 i • 1 IIVl 1 IV buJ 


1103 


2161-60.HMTNB 


1104 


2161-79.HMTNB 


1105 


2161-83.HMTNB 


1106 


2161-90.HMTNB 


1107 


2161-94.HMTNB 


1108 


2162-05.HMTNB 


1109 


2162-12.HMTNB 


1110 


2162-13.HMTNB 


111,1 


2162-18.HMTNB 


1112 


2162-35.HMTNB 


1113 


21 62-41 .HMTNB 


1114 


2162-50.HMTNB 


1115 


2162-59.HMTNB 


me 


2162-63.HMTNB 


1117 


21 62-71 .HMTNB 


111€ 


\ 2162-75.HMTNB 


111£ 


I 2162-78.HMTNB 


112C 


) 2163-07.HMTNB 



OCU 1 U IN VJ . : . 




1191 


P1 fi^-1 1 HMTNR 

£ I DO I I .nivi I IND 


1 19P 

I I ££ 


P1 fi^-1 ft HMTMR 


1 1P*3 


P1fi^-P^ HMTMR 

C I UO CO. 1. IVl 1 \H LJ 


1 1P4 

I I £H 


PIR^-Pc; HMTMR 


1 1PR 
1 1 £\J 


P1R^-49 HMTMR 
c. \ oo *+o.nivi i inD 


1 1PR 
1 1 £\J 


PIR^-^D HMTMR 

C \ DO Ou.niVI I IMD 


1 1P7 

1 \ £f 


916^-61 HMTMR 


1 1P8 


PlfWfi^ HMTMR 

C 1 DO UJ.niVI ! IND 


1 1PQ 

1 1 £\3 


21 6^-7^ HMTMR 

C. \ DO f o.nivi 1 IND 


1 i^n 

1 1 o u 


P1fi^-77 HMTMR 

c. I DO / / .nivl 1 IND 


1 1^1 

1 1 O 1 


91 R9-R7 HMTMR 
c. I DO-O/ .nivi I IND 


1 1*39 

1 1 0£ 


91M-Q9 HMTMR 

c. i Do-yo.nivi i iND 


1 1^^ 


91fi / 3-OE; HMTMR 

d. \ oo-yo.nivi i ind 


1 1<*4 
I I O^ 


c. I OO-UH.niVI I IND 


1 1 *3R 


91 ft^-Oft HMTMR 
c. \ oo-uo.nivi I IND 


I I oo 


91fic:-94 HMTMR 
& \ oo-*c*f .nivi l IND 


1 1*V7 

MO/ 


P1ft c >-4R HMTNR 


1 1 ^ft 
I I OO 


91RR-5Q HMTMR 

£ I Ow'-ww'.niVl I IND 


1 1^Q 

1 1 Ow 


91 RR-R^ HMTMR 

C. I DO DO. nivl I IND 


1 140 


HMTNR 

£ I DD UtCniVI I IND 


1 141 


91RR-19 HMTMR 

c. \ DD 1 & .nivl 1 IND 


1 149 


91RR-49 HMTMR 
c. I DD-*ftC.nivl I IND 


1 149 
I 1 *fO 


91RR-4R HMTMR 
c. IDD-H-D.nivl I IND 


1 144 
I I **** 


91RR-47 HMTMR 
c. I DD-H- / .nivl 1 IND 


1 14^ 


91R7-H7 HMTMR 
c. ID/-U/ .rllvi I IND 


1 1 4ft 


91R7-1R HMTMR 
d. ID/- I D . n I V I I IN D 


1 1 47 


91R7-49 HMTMR 
c. I D/-*+*c.nivl I IND 


l I HO 


91R7-RR HMTMR 

£ ID/ DO.nivl I IND 


1 14Q 


91R7-RR HMTMR 
£ !Of oo.nivi 1 IND 


11Cf| 
I t Ow 


91R7-7Q HMTMR 

£ I D / / i7. n IVl I IND 


1 1^1 


91R7-QD HMTMR 

£ 1 D f C7VJ. nivl 1 IND 


1 1R9 
1 1 Ot. 


91R7-Q4 HMTMR 

£ 1 D/ C3*T.niVl 1 IND 


1 1 WW 


91 HMTMR 


1 1 R4 

1 1 D* 


91 RQ-19 HMTNR 

£ I U C7 It.niVI 1 IND 


11 R5 

1 1 oo 


91RQ-1R HMTNR 


1 1 Sfi 


91RQ-17 HMTNR 

£ 1 Di7 1 / .nivl 1 IND 


1 1 R7 


91RQ-1Q HMTNR 

£ 1 Ow" 1 57,niVI 1 IND 


1 158 

I I OQ 


P1RQ-PP HMTNR 

£ 1 Ow ££*il\Vl 1 IND 


1159 


2169-96 HMTNB 

C IWw £.0.1 IIVl 1 INLJ 


1160 


2169-33 HMTNB 

C— \\J%3 ww.l IIVl 1 liU 


1161 


2169-42 HMTNB 

C 1 Ww "t— . 1 IIVl 1 INUJ 


1162 


2169-46 HMTNB 


1163 


2169-47.HMTNB 


1164 


2169-57.HMTNB 


1165 


2169-69.HMTNB 


1166 


2171-06.HMTNB 


1167 


' 2171-09.HMTNB 


1168 


I 21 71 -11. HMTNB 



Table IV (cont'd) 



OuUIUvllWi; 1 


Name :&rw*w$&t$Ati&£$ 




P171-PQ HMTNR 


1 17f) 


P171 -33 HMTNR 
1 1 / i oo. mvi i inu 


1 171 
lift 


2171-35 HMTNR 

C I r 1 OJ.nlVI 1 IND 


1 179 
life. 


9171-41 HMTNR 
I / I *+ i .nivi 1 IND 


1 17*3 
I I f o 


91 71-54 HMTNR 
c.i ( \ -ot.nivi I IMD 


1 174 


0171.C7 HMTNR 
1 ( i 3 / .mvi 1 IND 


1 175 


P171-RQ HMTNR 


1 17R 

II fU 


P171-RP HMTNR 

t. i / i ot..nivi i ind 


1 177 


2171-84 HMTNR 

C l 1 1 *J*T.I IIVI 1 1 M LJ 


1 178 

ll ru 


2171-85 HMTNR 

Cm 1 f 1 UUd IIVI 1 liU 


1 17Q 


2173-12 HMTNR 


1 180 


2173-34 HMTNR 

C 1 i w u^.l IIVI I IND 


11R1 

1 1 O 1 


2173-42 HMTNR 

C 1 / J tt..l HVI 1 IND 


1 1ft? 

t 1 Oc. 


P173-4R HMTNR 

C l f O'HO.rHVI 1 IND 


1 1 83 


2173-54 HMTNR 

C 1 / J JH.l IIVI 1 IND 


1 184 

1 I 0*+ 


2173-57 HMTNR 
1. 1 r o o/ .nivt 1 IND 


1 1 R5 


9173-75 HMTNR 
c. f / o / \j , n ivi ■ ind 


1 1 ftfi 
I I oo 


P173-RR HMTNR 


1 1R7 


2173-91 HMTNR 

CXI w 17 1 .IIIVI 1 IND 


1 1RR 
I I oo 


9175-Ofi HMTNR 

Cm I I J uu.nivi 1 IND 


1 1ftQ 


9175-1 5 HMTNR 

t. 1 / \J 1 O.nlVI 1 IND 


1 ion 


P175-P0 HMTNR 

C l f Ofc.VJ.ll IVI I IND 


1 1 91 

1 I CJ 1 


P1 75-58 HMTNR 

CXI »J Ju.nivi 1 IND 


1 1 QP 


9175-9R HMTNR 

C. 1 / O"C70.niVI 1 IND 


1 1 Q3 


P1 77-1 R HMTNR 
c. \ / / I u.mvi I IND 


1 1 Q4 


9177-70 HMTNR 
C \ I f " f u.nivi 1 IND 


1 1 Q5 

1 1 9J 


9177-RR HMTNR 

/II// ou. nivi 1 IND 


1 1QR 


917Q-09 HMTNR 

C 1 / %7 ucnivi 1 IND 


1 1Q7 
i i J? / 


917Q-03 HMTNR 

C I / 3 UO.I IIVI 1 IND 


1 1 Qft 

1 1 tJO 


917Q-1Q HMTNR 

C \ 1 v? I sj.nivi 1 IND 


1 1 QQ 

1 1 57S7 


917Q-99 HMTNR 

C \ 1 C7 fc.t-.ri IVI 1 IND 


1P0O 


2179-29 HMTNR 

L 1 1 V fc.^.1 IIVI 1 INLJ 


1201 


2179-39 HMTNR 

Cl I & \JZJ .1 IIVI 1 INLJ 


1 POP 


9179-R3 HMTNR 

fc. 1 / \7 DO.nivl 1 IND 


1P03 


9181-04 HMTNR 

L IU 1 W*T. I 1 IVI 1 1 ND 




2181-24 HMTNR 

£.101 fc,*T.I IIVI 1 IND 




2181-35 HMTNR 

C 1 O 1 OO.I IIVI 1 IND 


1P0R 


91R1-RR HMTNR 

c 10 1 uu.nivi 1 IND 


1P07 


91R1-75 HMTNR 

£.IO 1 /O.niVI 1 IND 


1P0P 


P1R1-76 HMTNR 

C 1 O 1 1 U.I IIVI 1 liU 


120C 


2181-84 HMTNB 


ipir 


1 2183-05 HMTNB 

£. IUO UJ>I IIVI 1 1 » 1— ' 


1 £l , 


01 1Q UMTMR 
cl 00- 1 o.nlvl 1 Ind 


1212 


! 2183-17.HMTNB 


1212 


! 2183-28.HMTNB 


121 ^ 


I 21 83-45. HMTNB 


121£ 


5 2183-50.HMTNB 


1216 


5 2183-51 .HMTNB 



OCvJ 1U InU. 1 




1217 

It. 1 / 


2183-70 HMTNB 


121R 

1 L, 1 U 


2185-05 HMTNR 

t_ IUJ UJtl IIVI 1 INLJ 


121Q 


2185-10 HMTNB 

Cm 1 UvJ 1 W>l IIVI 1 IND 


12P0 


21 R5-1 P HMTNR 

£L I OJ I £.11 IVI 1 IND 


1P91 

I CC I 


91R5-1R HMTNR 
c. 1 00 i o.nivi 1 ind 


1 P99 


9185-43 HMTNR 
c. \ 00 *+o.nivi 1 IND 


1PP3 

1 fc.fc.O 


9185-4Q HMTNR 
c. i 00 HcJ.nivi 1 Ind 


1P94 


91P.5-54 HMTNR 
c \ ou OH.nivi 1 ind 


1P95 


91R5-RP HMTNR 

£ I OiJ'OC.illVI I InD 


1PPR 

1 £.£0 


P1 R7-91 HMTNR 
c. I 0 / c. 1 . n ivi 1 Ind 


1997 


91R7-37 HMTNR 
c 10/ O f . n IVI 1 Ind 


1P9R 


P 187-47 HMTNR 

£ IO/ *T f .nivi 1 IND 


l CCiD 


P1R7-Q3 HMTNR 
c. 10/ oo.nivi 1 IND 


1930 


91R8-9P HMTNR 
c. I oo'^^.nivi 1 ind 


19*31 

I C.O 1 


91RR-9Q HMTNR 

C 1 OO £-C7.r IIVI 1 IND 


19^9 
1 C-Oc. 


918R-3P HMTNR 

C IOO Ot.niVI t IND 


I fc.OO 


91RR-59 HMTNR 
c 1 00 u<-.nivi I IND 


1 9^4 


91RR-54 HMTNR 
£. 1 00 oo.nivi 1 ind 


1 935 

1 tOO 


91RR-7P HMTNR 

C- \ OO / c .n IVI 1 1 N D 


1 93R 


P188-Q? HMTNR 

C- 1 OO C7t..rMVI 1 IND 


1 937 


91RQ-31 HMTNR 
c. 1 oc? 0 i .nivi 1 ind 


1 93R 
1 c.00 


918Q-5R HMTNR 

C IOJ7 oo.nivi 1 IND 


193Q 


P18Q-75 HMTNR 

C. IOO 1 vJ.I IIVI 1 IND 


1P40 


P18Q-84 HMTNR 

C. I OS? Of. nivi 1 IND 


1941 


91Q1-93 HMTNR 
c. I 9 I'^o.nivi 1 ind 


1 949 


91 Q1 -38 HMTNR 
c. 1 57 1 "OO.nivi 1 ind 


1P43 


P1Q1-58 HMTNR 

C 1 v7 1 OCJ.I IIVI I IND 


1P44 


P1Q1-73 HMTNR 

\ f O.I IIVI 1 IND 


1P45 


91Q1-77 HMTNR 

Cm 1 Z7 1 / / .1 IIVI 1 IND 


194R 


P1Q1-Q0 HMTNR 

Cm 1 17 1 ZJ\J . 1 I IVI 1 1 ND 


1947 


P1Q1-Q4 HMTNR 

Cm 1 C7 1 v7*t*I IIVI 1 IND 


1248 


P1Q1-QR HMTNR 

C I C7 1 ^ w ■ I IIVI 1 IND 


124Q 


2192-03 HMTNR 

Cm I0t V_/VJ . I IIVI 1 1 X 1— ' 


1 P50 


21Q2-14 HMTNR 

IC7£. 1 'T.l IIVI 1 IND 


1251 


21Q2-36 HMTNR 

Cm 1 C7£. vJU. » IIVI 1 IND 


1252 


21Q2-46 HMTNR 

C 1 C7fc> *^U*I IIVI 1 IND 


1P53 


21QP-RR HMTNR 

Cm l\3Cm OO.niVI 1 IND 


1 P54 


P1Q4-07 HMTNR 

C. I v7H"V// .IIIVI 1 IND 


1 P55 


P1Q4-13 HMTNR 

C 1 C7*T 1 o.nivi 1 IND 


1P5P 


2194-16 HMTNB 

C 1 C7*T lUil IIVI 1 IND 


1P57 


' 2194-18 HMTNB 

C IC7*T 1 IJ. 1 IIVI 1 IND 


125F 


1 2194-28 HMTNB 

1 Cm 1 C-\Jrl IIVI 1 1 lU 


1 O^C 


' c. \ yO"UD.niVI 1 Ind 


126C 


) 2195-47.HMTNB 


1261 


21 95-60. HMTNB 


1265 


2196-18.HMTNB 


126C 


! 2196-30.HMTNB 


126^ 


\ 2196-53.HMTNB 



61 



Table IV (cont'd) 



SEQID:;NO,:> 






^EQliPiNO:: 


^ arne. ?! : uis5'. ••• : Q- : 


12b5 


2 1 yb-bo.HM 1 Nb 




1313 


nnOH A A , t LIU JlTMD 

2231-44U.HM I Nb 


1266 


2 1yb-/b.HM 1 Nb 




H O H A 

1314 


OOOH C*n. i LIU JI*T*KI D 

2231 -50U.HMTNB 


12b/ 


1 9/-2o.HM 1 Nb 




1315 


OOOH CHu LI K A~1~K \ D 

2231 -51 U.HMTNB 


12bo 


2iy/-4b.HM 1 Nb 




lolb 


2231 -63U.HMTNB 


1269 


n -4 n~7 ch i i n /itk in 

21 97-51 .HM7 NB 




H O H "7 

1317 


onnn /■> o . . i_jhji*t*kid 

2231-68U.HMTNB 


1270 


OHQ7 CO LJU/ITMD 

21 97-59. HM i Nb 




1318 


OOO H ~7 A . . [1 Ik ITK i n 

2231-74U.HMTNB 


Id/ 1 


2202-9b.HM 1 Nb 




H oh n 

1319 


OOOH OOii L!K JITK ID 

2231 -82U.HMTNB 


1272 


OOAO OC LIK/ITMD 

2203-36. hi Ml Nb 




a onn 

1320 


nnon oc >• LJik*Tnn 

2231-85U.HMTNB 


H 0*70 

1273 


ooA/i no i \ \ <TKin 
2204-09. HM I Nb 




j on h 

1321 


2231-88U.HMTNB 


H 0*7 /4 

1274 


oooc h h ukjitmd 
2205-11. HMI Nb 




h onn 

1322 


OOO H Oil.. U \ A~T"h 1 D 

2231-94U.HMTNB 


1275 


none on i m irun 

2205-33. HM I Nb 




h ono 

1323 


OOO H n C . . LIRJITMD 

2231 -95U.HMTNB 


1276 


2205-43. HM 1 NB 




«i on it 

1324 


nnnn n o ■ ■ Lin jiTMn 

2232-03U.HMTNB 


1277 


none or UUTMO 

2205-85. HMT NB 




none 

1325 


nnoo -4 4.1 l J !k a t m r> 

2232-1 1U.HMTNB 


1278 


onnn no.. JITPKIQ 

2229-08U.HMTNB 




1326 


nnnn H n. . i ik i tt k l n 

2232-1 9U.HMTNB 


1279 


nnon -* o . . i t h a~t~k l n 

2229-1 Ou.HMTNB 




1327 


o o o o nr.. i ihji*t*kid 

2232-25U.HMTNB 


1280 


onnn jo., im jitpk id 

2229-1 2U.HMTNB 




1328 


2232-30u.HMTNB 


1281 


nnon a a., t i R jitk i n 

2229-1 4U.HMTNB 




■i nnn 

1329 


O O O O AA.. 1 IK *TMH 

2232-44u.HMTNB 


^ ono 
1282 


nnon n*7. . t I n jitk t D 

2229-27U.HMTNB 




a nnn 

1330 


nnnn rrn. ■ t I & mt k t o 

2232-50U.HMTNB 


1283 


onon /in,, lhjitmd 

2229-40U.HMTNB 




4 on H 

1331 


O O O O cn.. LlfcJITMD 

2232-56U.HMTNB 


«* no a 

1284 


2229-45U.HMTNB 




H OOO 

1332 


onon on t ■ i_ifcj|-rMD 

2232-60U.HMTNB 


J oo c 

1285 


onon /ton i_j k itk in 

2229-48U.HMTNB 




H OOO 

1333 


onon CAtt i i* j~tk id 

2232-64U.HMTNB 


4 o o o 

1286 


nnon rn.i LJRiiTMD 

2229-50U.HM 1 Nb 




1334 


nnon th ■ . i i > <Tfc 1 d 

2232-71 U.HM I Nb 


1287 


nnon C/i.. i i fc jitk id 

2229-54U.HMTNB 




a one 

1335 


OOOO 70i . 1 1 1 ITK 1 D 

2232-73U.HMTNB 


4000 

1288 


onnn coit ukatmd 

2229-56U.HM 1 Nb 




1336 


OOOO OOt ■ 1 1 k ITh 1 D 

2232-80U.HM 1 Nb 


noon 

1289 


OOOn C"7i ■ 1 Ifc ITM D 

2229-57U.HM 1 Nb 




H 007 

1337 


OOOO OOi i 1 1 H HTK ID 

2232-83U.HM INB 


•* onn 

1290 


noon r~ n ■ ■ i in <~r~ k i n 

2229-59U.HMTNB 




H OOO 

1338 


nnoo no. ■ i in k i d 

2233-02U.HMTNB 


1291 


onnn 70, , 1 n <*tr i d 

2229-70U.HMTNB 




h oon 

1339 


OOOO CO.. t J K yi "T" MD 

2233-53U.HMTNB 


1292 


nnon 0*7.. ljkjitkid 

2229-87U.HMTNB 




h o ji n 

1340 


nnoo n,. umiTMn 

2233-57U.HMTNB 


a onn 

1293 


nnnn n -4 . . uii ATMn 

2229-91 U.HMTNB 




H O /I H 

1341 


oooo co.. umiTMn 

2233-58U.HMTNB 


•4 on A 

1294 


nnnn nc . 1 1 k atk ! n 

2229-95U.HMTNB 




1342 


nnoo on.. LJUJITKID 

2233-80U.HMTNB 


■4 one 

1295 


2230-07U.HM 1 Nb 




H O. yl O 

1343 


OOOO QHii UI\JITMD 

2233-81 U.HM 1 Nb 


4 ooe 
1296 


OOOA <1 in UyTMD 

2230-1 1 U.HM I Nb 




H Ovl /I 

1344 


OOOO OOn UMTMD 

2233-83U.HMTNb 


129/ 


2230-1 9U.HM 1 Nb 




1o4o 


2^o4-0^U.HM 1 Nb 


1298 


2230-2/U.HM 1 Nb 




io4b 


22o4-0oU.HM 1 Nb 


1299 


2230-ooU.HM 1 Nb 




1 34/ 


22o4-0bU.HM 1 Nb 


o nn 

1300 


onon A H . i UfcATKIQ 

2230-41 U.HM I Nb 




1 348 


OOO A no • i LJ1VATMQ 

2234-06U.HM FNb 


h on* 

130 


OOOA CH i i LJRjITMD 

2230-51 U.HM i Nb 




4 n AC\ 

1 349 


OOO/I OO,, i in ITfc ID 

2234-09U.HM INB 


1302 


onon COi ■ LJfcJITRIQ 

2230-56U.HMTNB 




1350 


2234-1 2U.HMTNB 


1303 


ooon ecu i m hth id 
2230-66U.HM I Nb 




1 351 


OOO/I OOn LJKV1TKID 

2234-23U.HM 1 Nb 


1304 


onon 74n i_i K /iTMn 

2230-71 U.HMTNB 




ocn 

1352 


OOO/I OCii UKJITMD 

2234-20U.HM 1 Nb 


•4 onc 

130c 


2230-75. HM 1 Nb 




lo5o 


22o4-4bU.nM 1 Nb 


130c 


OOOf\ Din LJIUITMD 

2230-81 U.HM 1 Nb 






OO0>1 CCm umtmq 

2234-ddU.HM I Nb 


1 Ou f 


U7U.I JiVI 1 MU 




13^ 


P {.LOT U( U.I 1 J V 1 1 1 X l-J 


130£ 


1 2230-93U.HMTNB 




i35e 


. 2234-70U.HMTNB 


130S 


1 2231-23U.HMTNB 




1357 


' 2234-74U.HMTNB 


131C 


) 2231-26U.HMTNB 




135E 


( 2234-77U.HMTNB 


1311 


2231-32U.HMTNB 




135S 


) 2234-82U.HMTNB 


1315 


> 2231-37U.HMTNB 




136C 


) 2234-88U.HMTNB 
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Table IV (cont'd) 



Q 

u 
W 

m 

2 

m 
a 



SEQM)^ 


.Sm^illillliiillsl 




SEQIQiNO: 




1361 


2234-89U.HMTNB 




1409 


2244-47 


1362 


2234-90U.HMTNB 




1410 


2244-52 


1363 


2234-93U.HMTNB 




1411 


2244-57 


1364 


2240-39 




1412 


2244-63 


1365 


2240-40 




1413 


2244-68 


1366 


2240-49 




1414 


2244-77 


. 1367 


2240-51 




1415 


2244-80 


1368 


2240-57 




1369 


2240-61 




1370 


2240-62 




1371 


2241-2 




1372 


2241-3 




1373 


2241-8 




1374 


2241-9 




1375 


2241-13 




1376 


2241-21 




1377 


2241-29 




1378 


2241-38 




1379 


2241-45 




1380 


2241-49 




1381 


2241-51 




1382 


2241-57 




1383 


2241-63 




1384 


2241-68 




1385 


2241-89 




1386 


2241-91 




1387 


2243-2 




1388 


2243-3 




1389 


2243-12 




1390 


2243-14 




1391 


2243-19 




1392 


2243-24 




1393 


2243-25 




1394 


2243-33 




1395 


2243-49 




1396 


2243-50 




1397 


2243-51 




139fi 


I 2243-59 




139S 


I 2243-63 




140C 


) 2243-69 




1401 


2243-74 




140£ 


! 2243-75 




1405 


2243-77 




140' 


I 2244-19 




140£ 


5 2244-26 




140e 


3 2244-35 




1401 


' 2244-38 




140J 


3 2244-40 
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In one embodiment, a gene or other nucleic acid molecule of the present invention 
can be an allelic variant that includes a similar but not identical sequence to SEQ ID 
NO:l, SEQ ID NO:3, SEQ ID NO:4, SEQ ED NO:6, SEQ ID NO:7, SEQ ID NO:9, SEQ 
ID NO: 10, SEQ ID NO: 12, SEQ ID NO: 13, SEQ ID NO: 15, SEQ ID NO: 16, SEQ ID 
5 NO: 18, SEQ ID NO: 19, SEQ ID NO:21, SEQ ID NO:22, SEQ ID NO:24, SEQ ID 
NO:25, SEQ IDNO:27, SEQ ID NO:28, SEQ ID NO:30, SEQ ID NO:31, SEQ ID 
NO:33, SEQ ID NO:34, SEQ ID NO:36, SEQ ID NO:37, SEQ ID NO:39, SEQ ID 
NO:40, SEQ ID NO:42, SEQ ID NO:43, SEQ ID NO:45, SEQ ID NO:46, SEQ ID 
NO:48, SEQ ID NO:153, SEQ ID NO:155, SEQ ID NO:156, SEQ ID NO:158, SEQ ID 
10 NO: 159, SEQ ID NO: 161, SEQ ID NO: 162, SEQ ID NO: 164, SEQ ID NO:165, SEQ ID 
NO: 167, SEQ ID NO: 168, SEQ ID NO: 170, SEQ ID NO: 1859, SEQ ID NO: 1860, SEQ 
ID N0.1861, SEQ ID NO:1863, SEQ ID NO:1864, SEQ ID NO:1866, SEQ ID NO:1867, 
SEQ ID NO: 1869, SEQ ID NO: 1870, SEQ ID NO: 1871, SEQ ID NO: 1872, SEQ ID 
NO: 1874, SEQ ID NO: 1875, SEQ ID NO: 1876, SEQ ID NO: 1877, SEQ ID NO: 1878, 
1 5 SEQ ID NO: 1 880, SEQ ID NO: 1 88 1 , SEQ ID NO: 1 8 82, SEQ ID NO : 1 884, SEQ ID 
NO: 1885, SEQ ID NO: 1886, SEQ ID NO: 1887, SEQ ID NO: 1889, SEQ ID NO: 1890, 
SEQ ID NO:1891, SEQ ID NO:1892, SEQ ID NO:1893, SEQ ID NO:1894, SEQ ID 
NO: 1895, SEQ ID NO: 1896, SEQ ID NO: 1898, SEQ ID NO: 1899, SEQ ID NO: 1900, 
SEQ ID NO: 1901, SEQ ID NO: 1903, SEQ ID NO: 1904, SEQ ID NO: 1905, SEQ ID 
20 NO: 1906, SEQ ID NO: 1907, SEQ ID NO:1908, SEQ ID NO: 1909, SEQ ID NO:1910, 
SEQ ID NO:191 1, SEQ ID NO:1912, SEQ ID NO:1913, SEQ ID NO:1914, SEQ ID 
NO:1916, SEQIDNO:1917, SEQIDNO:1918, SEQ ID NO:1919, SEQn)NO:1921, 
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SEQ ID NO: 1922, SEQ ID NO: 1923, SEQ ID NO: 1924, SEQ ID NO: 1926, SEQ ID 
NO:1927, SEQ ID NO:1928, SEQ ID NO:1929, and/or SEQ ID NO:1931 or zCfelis 
nucleic acid sequence of Table I, Table II, Table EI and/or Table IV or a complement 
thereof. For example, an allelic variant of a C.felis ALN gene including SEQ ID NO:l is 
a gene that occurs at essentially the same locus (or loci) in the genome as the gene 
including SEQ ID NO:l, but which, due to natural variations caused by, for example, 
mutation or recombination, has a similar but not identical sequence. Because natural 
selection typically selects against alterations that affect function, allelic variants (i.e. 
alleles corresponding to, or of, cited nucleic acid sequences) usually encode proteins 
having similar activity to that of the protein encoded by the gene to which they are being 
compared. Allelic variants of genes or nucleic acid molecules can also comprise 
alterations in the 5' or 3 1 untranslated regions of the gene (e.g., in regulatory control 
regions), or can involve alternative splicing of a nascent transcript, thereby bringing 
alternative exons into juxtaposition. Allelic variants are well known to those skilled in 
the art and would be expected to occur naturally within a given flea such as C. felis, since 
the genome is diploid, and sexual reproduction will result in the reassortment of alleles. 
For example, SEQ ID NO: 162 is apparently an allelic variant or multiple gene of SEQ ID 
NO:153. 

In one embodiment of the present invention, isolated HMT and HNC proteins are 
encoded by nucleic acid molecules that hybridize under stringent hybridization conditions 
to genes or other nucleic acid molecules encoding flea HMT and HNC proteins, 
respectively. The minimal size of HMT and HNC proteins of the present invention is a 
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size sufficient to be encoded by a nucleic acid molecule capable of forming a stable 
hybrid (i.e., hybridizing under stringent hybridization conditions) with the complementary 
sequence of a nucleic acid molecule encoding the corresponding natural protein. The size 
of a nucleic acid molecule encoding such a protein is dependent on the nucleic acid 
composition and the percent homology between the flea HMT or HNC nucleic acid 
molecule and the complementary nucleic acid sequence. It can easily be understood that 
the extent of homology required to form a stable hybrid under stringent conditions can 
vary depending on whether the homologous sequences are interspersed throughout a 
given nucleic acid molecule or are clustered (i.e., localized) in distinct regions on a given 
nucleic acid molecule. 

The minimal size of a nucleic acid molecule capable of forming a stable hybrid 
with a gene encoding a flea HMT or HNC protein is typically at least about 12 to about 
15 nucleotides in length if the nucleic acid molecule is GG-rich and at least about 15 to 
about 17 bases in length if it is AT-rich. The minimal size of a nucleic acid molecule 
used to encode an HMT or HNC protein homologue of the present invention is from 
about 12 to about 18 nucleotides in length. Thus, the minimal size of HMT or HNC 
protein homologues of the present invention is from about 4 to about 6 amino acids in 
length. There is no limit, other than a practical limit, on the maximal size of a nucleic 
acid molecule encoding a flea HMT or HNC protein of the present invention because a 
nucleic acid molecule of the present invention can include a portion of a gene, an entire 
gene, or multiple genes. The preferred size of a protein encoded by a nucleic acid 
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molecule of the present invention depends on whether a full-length, fusion, multivalent, 
or functional portion of such a protein is desired. 

Stringent hybridization conditions are determined based on defined physical 
properties of the gene to which the nucleic acid molecule is being hybridized, and can be 
defined mathematically. Stringent hybridization conditions are those experimental 
parameters that allow an individual skilled in the art to identify significant similarities 
between heterologous nucleic acid molecules. These conditions are well known to those 
skilled in the art. See, for example, Sambrook, et ai, 1989, Molecular Cloning: A 
Laboratory Manual, Cold Spring Harbor Labs Press, and Meinkoth, et aL, 1984, Anal 
Biochem. 735, 267-284, each of which is incorporated by reference herein in its entirety. 
As explained in detail in the cited references, the determination of hybridization 
conditions involves the manipulation of a set of variables including the ionic strength (M, 
in moles/liter), the hybridization temperature (°C), the concentration of nucleic acid helix 
destabilizing agents (such as formamide), the average length of the shortest hybrid duplex 
(n), and the percent G + C composition of the fragment to which an unknown nucleic acid 
molecule is being hybridized. For nucleic acid molecules of at least about 150 
nucleotides, these variables are inserted into a standard mathematical formula to calculate 
the melting temperature, or T m , of a given nucleic acid molecule. As defined in the 
formula below, T m is the temperature at which two complementary nucleic acid molecule 
strands will disassociate, assuming 100% complementarity between the two strands: 
T m = 81.5°C + 16.6 log M + 0.41(%G + C) - 500/n - 0.61(%formamide). 
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For nucleic acid molecules smaller than about 50 nucleotides, hybrid stability is defined 
by the dissociation temperature (T^, which is defined as the temperature at which 50% of 
the duplexes dissociate. For these smaller molecules, the stability at a standard ionic 
strength is defined by the following equation: 

T d = 4(G + C) + 2(A + T). 
A temperature of 5°C below T d is used to detect hybridization between perfectly matched 
molecules. 

Also well known to those skilled in the art is how base pair mismatch, i.e. 
differences between two nucleic acid molecules being compared, including non- 
complementarity of bases at a given location, and gaps due to insertion or deletion of one 
or more bases at a given location on either of the nucleic acid molecules being compared, 
will affect T m or T d for nucleic acid molecules of different sizes. For example, T m 
decreases about 1°C for each 1% of mismatched base pairs for hybrids greater than about 
150 bp, and T d decreases about 5°C for each mismatched base pair for hybrids below 
about 50 bp. Conditions for hybrids between about 50 and about 150 base pairs can be 
determined empirically and without undue experimentation using standard laboratory 
procedures well known to those skilled in the art. These simple procedures allow one 
skilled in the art to set the hybridization conditions (by altering, for example, the salt 
concentration, the formamide concentration or the temperature) so that only nucleic acid 
hybrids with greater than a specified % base pair mismatch will hybridize. Stringent 
hybridization conditions are commonly understood by those skilled in the art to be those 
experimental conditions that will allow less than or equal to about 30% base pair 
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mismatch (i.e., at least about 70% identity). Because one skilled in the art can easily 
determine whether a given nucleic acid molecule to be tested is less than or greater than 
about 50 nucleotides, and can therefore choose the appropriate formula for determining 
hybridization conditions, he or she can determine whether the nucleic acid molecule will 
hybridize with a given gene under stringent hybridization conditions and similarly 
whether the nucleic acid molecule will hybridize under conditions designed to allow a 
desired amount of base pair mismatch. 

Hybridization reactions are often carried out by attaching the nucleic acid 
molecule to be hybridized to a solid support such as a membrane, and then hybridizing 
with a labeled nucleic acid molecule, typically referred to as a probe, suspended in a 
hybridization solution. Examples of common hybridization reaction techniques include, 
but are not limited to, the well-known Southern and northern blotting procedures. 
Typically, the actual hybridization reaction is done under non-stringent conditions, i.e., at 
a lower temperature and/or a higher salt concentration, and then high stringency is 
achieved by washing the membrane in a solution with a higher temperature and/or lower 
salt concentration in order to achieve the desired stringency. 

For example, if the skilled artisan wished to identify a nucleic acid molecule that 
hybridizes under conditions that would allow less than or equal to 30% pair mismatch 
with a flea nucleic acid molecule of about 150 bp in length or greater, the following 
conditions could preferably be used. The average G + C content of flea DNA is about 
37%, as calculated from known flea nucleic acid sequences. The unknown nucleic acid 
molecules would be attached to a support membrane, and the 150 bp probe would be 
labeled, e.g. with a radioactive tag. The hybridization reaction could be carried out in a 
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solution comprising 2X SSC and 0% formamide, at a temperature of about 37°C (low 
stringency conditions). Solutions of differing concentrations of SSC can be made by one 
of skill in the art by diluting a stock solution of 20X SSC (175.3 gram NaCl and about 
88.2 gram sodium citrate in 1 liter of water, pH 7) to obtain the desired concentration of 
SSC. The skilled artisan would calculate the washing conditions required to allow up to 
30% base pair mismatch. For example, in a wash solution comprising IX SSC and 0% 
fonnamide, the T m of perfect hybrids would be about 77°C: 

81.5°C + 16.6 log (ASM) + (0.41 x 0.37) - (500/150) - (0.61 x 0) = 77.5°C. 
Thus, to achieve hybridization with nucleic acid molecules having about 30% base pair 
mismatch, hybridization washes would be carried out at a temperature of less than or 
equal to 47.5°C. It is thus within the skill of one in the art to calculate additional 
hybridization temperatures based on the desired percentage base pair mismatch, formulae 
and G/C content disclosed herein. For example, it is appreciated by one skilled in the art 
that as the nucleic acid molecule to be tested for hybridization against nucleic acid 
molecules of the present invention having sequences specified herein becomes longer 
than 150 nucleotides, the T m for a hybridization reaction allowing up to 30% base pair 
mismatch will not vary significantly from 47.5 °C. 

Furthermore, it is known in the art that there are commercially available computer 
programs for determining the degree of similarity between two nucleic acid sequences. 
These computer programs include various known methods to determine the percentage 
identity and the number and length of gaps between hybrid nucleic acid molecules. 
Preferred methods to determine the percent identity among amino acid sequences and 
also among nucleic acid sequences include analysis using one or more of the 
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commercially available computer programs designed to compare and analyze nucleic acid 
or amino acid sequences. These computer programs include, but are not limited to, the 
Wisconsin Package Version 9.0 sequence analysis software, available from Genetics 
Computer Group (GCG™), Madison, WI, DNAsis™, available from Hitachi Software, 
San Bruno, CA, and Mac Vector™, available from the Eastman Kodak Company, New 
Haven, CT. A preferred method to determine percent identity among amino acid 
sequences and also among nucleic acid sequences includes using the GAP program with 
pair-wise comparisons within the GCG™ Wisconsin Package Version 9.0 sequence 

analysis software, hereinafter referred to as default parameters. 

One embodiment of the present invention includes flea ALN, CBP, NKAB, 
LGIC, ANON, MALV, OS-D, NMDA, CLBP, NAH, CLIC, PL2, PL3, PL4, SVP, 
VGCC, AUP, and 7B2 proteins. A preferred flea ALN protein includes a protein encoded 
by a nucleic acid molecule that hybridizes under conditions that preferably allow less than 
or equal to about 30% base pair mismatch, more preferably under conditions that allow 
less than or equal to about 25% base pair mismatch, more preferably under conditions 
that allow less than or equal to about 20% base pair mismatch, more preferably under 
conditions that allow less than or equal to about 15% base pair mismatch, more preferably 
under conditions that allow less than or equal to about 10% base pair mismatch and even 
more preferably under conditions that allow less than or equal to about 5% base pair 
mismatch with a nucleic acid molecule selected from the group consisting of SEQ ID 
NO:3 and SEQE>NO:6. 
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A preferred flea CBP protein includes a protein encoded by a nucleic acid 
molecule that hybridizes under conditions that preferably allow less than or equal to about 
30% base pair mismatch, more preferably under conditions that allow less than or equal 
to about 25% base pair mismatch, more preferably under conditions that allow less than 
or equal to about 20% base pair mismatch, more preferably under conditions that allow 
less than or equal to about 15% base pair mismatch, more preferably under conditions 
that allow less than or equal to about 10% base pair mismatch and even more preferably 
under conditions that allow less than or equal to about 5% base pair mismatch with a 
nucleic acid molecule selected from the group consisting of SEQ ID NO:9 and SEQ ID 
NO:12. 

A preferred flea NKAB protein includes a protein encoded by a nucleic acid 
molecule that hybridizes under conditions that preferably allow less than or equal to about 
30% base pair mismatch, more preferably under conditions that allow less than or equal 
to about 25% base pair mismatch, more preferably under conditions that allow less than 
or equal to about 20% base pair mismatch, more preferably under conditions that allow 
less than or equal to about 15% base pair mismatch, more preferably under conditions 
that allow less than or equal to about 10% base pair mismatch and even more preferably 
under conditions that allow less than or equal to about 5% base pair mismatch with a 
nucleic acid molecule selected from the group consisting of SEQ ID NO: 15 and SEQ ID 
NO:18. 

A preferred flea LGIC protein includes a protein encoded by a nucleic acid 
molecule that hybridizes under conditions that preferably allow less than or equal to about 
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30% base pair mismatch, more preferably under conditions that allow less than or equal 
to about 25% base pair mismatch, more preferably under conditions that allow less than 
or equal to about 20% base pair mismatch, more preferably under conditions that allow 
less than or equal to about 15% base pair mismatch, more preferably under conditions 
that allow less than or equal to about 10% base pair mismatch and even more preferably 
under conditions that allow less than or equal to about 5% base pair mismatch with a 
nucleic acid molecule selected from the group consisting of SEQ ID NO:21 and SEQ ID 
NO:24, SEQ ID NO: I860, SEQ ID NO: 1863, and SEQ ID NO: 1866. 

A preferred flea ANON protein includes a protein encoded by a nucleic acid 
molecule that hybridizes under conditions that preferably allow less than or equal to about 
30% base pair mismatch, more preferably under conditions that allow less than or equal 
to about 25% base pair mismatch, more preferably under conditions that allow less than 
or equal to about 20% base pair mismatch, more preferably under conditions that allow 
less than or equal to about 15% base pair mismatch, more preferably under conditions 
that allow less than or equal to about 10% base pair mismatch and even more preferably 
under conditions that allow less than or equal to about 5% base pair mismatch with a 
nucleic acid molecule selected from the group consisting of SEQ ID NO:27 and SEQ ID 
NO:30. 

A preferred flea MALV protein includes a protein encoded by a nucleic acid 
molecule that hybridizes under conditions that preferably allow less than or equal to about 
30% base pair mismatch, more preferably under conditions that allow less than or equal 
to about 25% base pair mismatch, more preferably under conditions that allow less than 
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or equal to about 20% base pair mismatch, more preferably under conditions that allow 
less than or equal to about 15% base pair mismatch, more preferably under conditions 
that allow less than or equal to about 10% base pair mismatch and even more preferably 
under conditions that allow less than or equal to about 5% base pair mismatch with a 
5 nucleic acid molecule selected from the group consisting of SEQ ED NO:33 and SEQ ID 
NO:36. 

A preferred flea OS-D protein includes a protein encoded by a nucleic acid 
molecule that hybridizes under conditions that preferably allow less than or equal to about 
30% base pair mismatch, more preferably under conditions that allow less than or equal 

10 to about 25% base pair mismatch, more preferably under conditions that allow less than 
or equal to about 20% base pair mismatch, more preferably under conditions that allow 
less than or equal to about 1 5% base pair mismatch, more preferably under conditions 
that allow less than or equal to about 10% base pair mismatch and even more preferably 
under conditions that allow less than or equal to about 5% base pair mismatch with a 

15 nucleic acid molecule selected from the group consisting of SEQ ID NO:39 and SEQ ID 
NO:42. 

A preferred flea NMDA protein includes a protein encoded by a nucleic acid 
molecule that hybridizes under conditions that preferably allow less than or equal to about 
30% base pair mismatch, more preferably under conditions that allow less than or equal 
20 to about 25% base pair mismatch, more preferably under conditions that allow less than 
or equal to about 20% base pair mismatch, more preferably under conditions that allow 
less than or equal to about 15% base pair mismatch, more preferably under conditions 
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that allow less than or equal to about 10% base pair mismatch and even more preferably 
under conditions that allow less than or equal to about 5% base pair mismatch with a 
nucleic acid molecule selected from the group consisting of SEQ ID NO:45 and SEQ ID 
NO:48. 

A preferred flea CLBP protein includes a protein encoded by a nucleic acid 
molecule that hybridizes under conditions that preferably allow less than or equal to about 
30% base pair mismatch, more preferably under conditions that allow less than or equal 
to about 25% base pair mismatch, more preferably under conditions that allow less than 
or equal to about 20% base pair mismatch, more preferably under conditions that allow 
less than or equal to about 15% base pair mismatch, more preferably under conditions 
that allow less than or equal to about 10% base pair mismatch and even more preferably 
under conditions that allow less than or equal to about 5% base pair mismatch with a 
nucleic acid molecule selected from the group consisting of SEQ ID NO: 155, SEQ ID 
NO: 158, SEQ ID NO:161, SEQ ID NO:164, SEQ ID NO:167 and SEQ ID NO: 170. 

A preferred flea NAH protein includes a protein encoded by a nucleic acid 
molecule that hybridizes under conditions that preferably allow less than or equal to about 
30% base pair mismatch, more preferably under conditions that allow less than or equal 
to about 25% base pair mismatch, more preferably under conditions that allow less than 
or equal to about 20% base pair mismatch, more preferably under conditions that allow 
less than or equal to about 15% base pair mismatch, more preferably under conditions 
that allow less than or equal to about 10% base pair mismatch and even more preferably 
under conditions that allow less than or equal to about 5% base pair mismatch with a 
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nucleic acid molecule selected from the group consisting of SEQ ID NO: 1869 and SEQ 
IDNO:1871. 

A preferred flea CLIC protein includes a protein encoded by a nucleic acid 
molecule that hybridizes under conditions that preferably allow less than or equal to about 

5 30% base pair mismatch, more preferably under conditions that allow less than or equal 
to about 25% base pair mismatch, more preferably under conditions that allow less than 
or equal to about 20% base pair mismatch, more preferably under conditions that allow 
less than or equal to about 15% base pair mismatch, more preferably under conditions 
that allow less than or equal to about 10% base pair mismatch and even more preferably 

10 under conditions that allow less than or equal to about 5% base pair mismatch with a 
nucleic acid molecule selected from the group consisting of SEQ ID NO: 1874 and SEQ 
IDNO:1876. 

A preferred flea PL2 protein includes a protein encoded by a nucleic acid 
molecule that hybridizes under conditions that preferably allow less than or equal to about 

15 30% base pair mismatch, more preferably under conditions that allow less than or equal 
to about 25% base pair mismatch, more preferably under conditions that allow less than 
or equal to about 20% base pair mismatch, more preferably under conditions that allow 
less than or equal to about 15% base pair mismatch, more preferably under conditions 
that allow less than or equal to about 10% base pair mismatch and even more preferably 

20 under conditions that allow less than or equal to about 5% base pair mismatch with a 
nucleic acid molecule selected from the group consisting of SEQ ID NO: 1879, SEQ ID 
NO: 1881, SEQ ID NO:1884, and SEQ ID NO: 1886. 
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A preferred flea CPL3 protein includes a protein encoded by a nucleic acid 
molecule that hybridizes under conditions that preferably allow less than or equal to about 
30% base pair mismatch, more preferably under conditions that allow less than or equal 
to about 25% base pair mismatch, more preferably under conditions that allow less than 
or equal to about 20% base pair mismatch, more preferably under conditions that allow 
less than or equal to about 1 5% base pair mismatch, more preferably under conditions 
that allow less than or equal to about 10% base pair mismatch and even more preferably 
under conditions that allow less than or equal to about 5% base pair mismatch with a 
nucleic acid molecule selected from the group consisting of SEQ ID NO: 1889 and SEQ 
IDNO:189L 

A preferred flea PL4 protein includes a protein encoded by a nucleic acid 
molecule that hybridizes under conditions that preferably allow less than or equal to about 
30% base pair mismatch, more preferably under conditions that allow less than or equal 
to about 25% base pair mismatch, more preferably under conditions that allow less than 
or equal to about 20% base pair mismatch, more preferably under conditions that allow 
less than or equal to about 15% base pair mismatch, more preferably under conditions 
that allow less than or equal to about 10% base pair mismatch and even more preferably 
under conditions that allow less than or equal to about 5% base pair mismatch with a 
nucleic acid molecule selected from the group consisting of SEQ ID NO: 1893, SEQ ID 
NO: 1895, SEQ ID NO:1898, and SEQ ID NO:1900. 

A preferred flea S VP protein includes a protein encoded by a nucleic acid 
molecule that hybridizes under conditions that preferably allow less than or equal to about 
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30% base pair mismatch, more preferably under conditions that allow less than or equal 
to about 25% base pair mismatch, more preferably under conditions that allow less than 
or equal to about 20% base pair mismatch, more preferably under conditions that allow 
less than or equal to about 15% base pair mismatch, more preferably under conditions 
that allow less than or equal to about 10% base pair mismatch and even more preferably 
under conditions that allow less than or equal to about 5% base pair mismatch with a 
nucleic acid molecule selected from the group consisting of SEQ ID NO: 1903 and SEQ 
IDNO:1905. 

A preferred flea VGCC protein includes a protein encoded by a nucleic acid 
molecule that hybridizes under conditions that preferably allow less than or equal to about 
30% base pair mismatch, more preferably under conditions that allow less than or equal 
to about 25% base pair mismatch, more preferably under conditions that allow less than 
or equal to about 20% base pair mismatch, more preferably under conditions that allow 
less than or equal to about 15% base pair mismatch, more preferably under conditions 
that allow less than or equal to about 10% base pair mismatch and even more preferably 
under conditions that allow less than or equal to about 5% base pair mismatch with a 
nucleic acid molecule selected from the group consisting of SEQ ID NO: 1907, SEQ ID 
NO:1909, SEQ ID NO:1911, SEQ ID NO:1913, SEQ ID NO:1916, and SEQ ID 
NO:1918. 

A preferred flea AUP protein includes a protein encoded by a nucleic acid 
molecule that hybridizes under conditions that preferably allow less than or equal to about 
30% base pair mismatch, more preferably under conditions that allow less than or equal 
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to about 25% base pair mismatch, more preferably under conditions that allow less than 
or equal to about 20% base pair mismatch, more preferably under conditions that allow 
less than or equal to about 15% base pair mismatch, more preferably under conditions 
that allow less than or equal to about 10% base pair mismatch and even more preferably 
under conditions that allow less than or equal to about 5% base pair mismatch with a 
nucleic acid molecule selected from the group consisting of SEQ ID NO: 1921 and SEQ 
IDNO:1923. 

A preferred flea 7B2 protein includes a protein encoded by a nucleic acid 
molecule that hybridizes under conditions that preferably allow less than or equal to about 
30% base pair mismatch, more preferably under conditions that allow less than or equal 
to about 25% base pair mismatch, more preferably under conditions that allow less than 
or equal to about 20% base pair mismatch, more preferably under conditions that allow 
less than or equal to about 15% base pair mismatch, more preferably under conditions 
that allow less than or equal to about 10% base pair mismatch and even more preferably 
under conditions that allow less than or equal to about 5% base pair mismatch with a 
nucleic acid molecule selected from the group consisting of SEQ ID NO: 1926, SEQ ID 
NO: 1928, and SEQ ID NO: 1931. 

A preferred flea HMT and/or HNC protein includes a protein encoded by a 
nucleic acid molecule that hybridizes under conditions that preferably allow less than or 
equal to about 30% base pair mismatch, more preferably under conditions that allow less 
than or equal to about 25% base pair mismatch, more preferably under conditions that 
allow less than or equal to about 20% base pair mismatch, more preferably under 
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conditions that allow less than or equal to about 15% base pair mismatch, more preferably 
under conditions that allow less than or equal to about 10% base pair mismatch and even 
more preferably under conditions that allow less than or equal to about 5% base pair 
mismatch with a nucleic acid molecule selected from the group consisting of a nucleic 
5 acid sequence complementary to a nucleic acid sequence of Table I, Table II, Table m 
and/or Table IV. 

Another embodiment of the present invention includes a flea ALN protein 
encoded by a nucleic acid molecule that hybridizes under conditions comprising, (a) 
hybridizing in a solution comprising IX SSC and 0% formamide, at a temperature of 

10 about 37°C and (b) washing in a solution comprising IX SSC and 0% formamide, at a 
temperature of about 47.5°C, to an isolated nucleic acid molecule selected from the group 
consisting of SEQ ID NO:3 and SEQ ID NO:6. 

Another embodiment of the present invention includes a flea CBP protein 
encoded by a nucleic acid molecule that hybridizes under conditions comprising, (a) 

15 hybridizing in a solution comprising IX SSC and 0% formamide, at a temperature of 
about 37°C and (b) washing in a solution comprising IX SSC and 0% formamide, at a 
temperature of about 47.5°C, to an isolated nucleic acid molecule selected from the group 
consisting of SEQ ID NO:9 and SEQ ID NO:13. 

Another embodiment of the present invention includes a flea NKAB protein 

20 encoded by a nucleic acid molecule that hybridizes under conditions comprising, (a) 
hybridizing in a solution comprising IX SSC and 0% formamide, at a temperature of 
about 37°C and (b) washing in a solution comprising IX SSC and 0% formamide, at a 
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temperature of about 47.5°C, to an isolated nucleic acid molecule selected from the group 
consisting of SEQ ID NO:15 and SEQ ID NO: 18. 

Another embodiment of the present invention includes a flea LGIC protein 
encoded by a nucleic acid molecule that hybridizes under conditions comprising, (a) 
5 hybridizing in a solution comprising IX SSC and 0% formamide, at a temperature of 
about 37°C and (b) washing in a solution comprising IX SSC and 0% formamide, at a 
temperature of about 47.5°C, to an isolated nucleic acid molecule selected from the group 
consisting of SEQ ID NO;21, SEQ ID NO:24, SEQ ID NO: 1860, SEQ ID NO: 1863, and 
SEQ ID NO: 1866. 

10 Another embodiment of the present invention includes a flea ANON protein 

encoded by a nucleic acid molecule that hybridizes under conditions comprising, (a) 
hybridizing in a solution comprising IX SSC and 0% formamide, at a temperature of 
about 37°C and (b) washing in a solution comprising IX SSC and 0% formamide, at a 
temperature of about 47.5°C, to an isolated nucleic acid molecule selected from the group 

15 consisting of SEQ ID NO:27 and SEQ ID NO:30. 

Another embodiment of the present invention includes a flea MALV protein 
encoded by a nucleic acid molecule that hybridizes under conditions comprising, (a) 
hybridizing in a solution comprising IX SSC and 0% formamide, at a temperature of 
about 37°C and (b) washing in a solution comprising IX SSC and 0% formamide, at a 

20 temperature of about 47.5°C, to an isolated nucleic acid molecule selected from the group 
consisting of SEQ ID NO:33 and SEQ ID NO:36. 
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Another embodiment of the present invention includes a flea OS-D protein 
encoded by a nucleic acid molecule that hybridizes under conditions comprising, (a) 
hybridizing in a solution comprising IX SSC and 0% formamide, at a temperature of 
about 37°C and (b) washing in a solution comprising IX SSC and 0% formamide, at a 
temperature of about 47.5°C, to an isolated nucleic acid molecule selected from the group 
consisting of SEQ ID NO:39 and SEQ ID NO:42. 

Another embodiment of the present invention includes a flea NMDA protein 
encoded by a nucleic acid molecule that hybridizes under conditions comprising, (a) 
hybridizing in a solution comprising IX SSC and 0% formamide, at a temperature of 
about 37°C and (b) washing in a solution comprising IX SSC and 0% formamide, at a 
temperature of about 47.5°C, to an isolated nucleic acid molecule selected from the group 
consisting of SEQ ID NO:45 and SEQ ID NO:48. 

Another embodiment of the present invention includes a flea CLBP protein 
encoded by a nucleic acid molecule that hybridizes under conditions comprising, (a) 
hybridizing in a solution comprising IX SSC and 0% formamide, at a temperature of 
about 37°C and (b) washing in a solution comprising IX SSC and 0% formamide, at a 
temperature of about 47.5°C, to an isolated nucleic acid molecule selected from the group 
consisting of SEQ ID NO: 155, SEQ ID NO:158, SEQ ID NO:161, SEQ ID NO:164, SEQ 
ID NO: 167 and SEQ ID NO: 170. 

Another embodiment of the present invention includes a flea NAH protein 
encoded by a nucleic acid molecule that hybridizes under conditions comprising, (a) 
hybridizing in a solution comprising IX SSC and 0% formamide, at a temperature of 
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about 37 C C and (b) washing in a solution comprising IX SSC and 0% formamide, at a 
temperature of about 47.5°Q to an isolated nucleic acid molecule selected from the group 
consisting of SEQ ID NO: 1869 and SEQ ID NO: 1871. 

Another embodiment of the present invention includes a flea CLIC protein 
encoded by a nucleic acid molecule that hybridizes under conditions comprising, (a) 
hybridizing in a solution comprising IX SSC and 0% formamide, at a temperature of 
about 37°C and (b) washing in a solution comprising IX SSC and 0% formamide, at a 
temperature of about 47.5°C, to an isolated nucleic acid molecule selected from the group 
consisting of SEQ ID NO:1874 and SEQ ID NO:1876. 

Another embodiment of the present invention includes a flea PL2 protein encoded 
by a nucleic acid molecule that hybridizes under conditions comprising, (a) hybridizing in 
a solution comprising IX SSC and 0% formamide, at a temperature of about 37°C and (b) 
washing in a solution comprising IX SSC and 0% formamide, at a temperature of about 
47.5°C, to an isolated nucleic acid molecule selected from the group consisting of SEQ ID 
NO:1879, SEQ ED NO:1881, SEQ ID NO:1884 and SEQ ID NO:1886. 

Another embodiment of the present invention includes a flea PL3 protein encoded 
by a nucleic acid molecule that hybridizes under conditions comprising, (a) hybridizing in 
a solution comprising IX SSC and 0% formamide, at a temperature of about 37°C and (b) 
washing in a solution comprising IX SSC and 0% formamide, at a temperature of about 
47.5°C, to an isolated nucleic acid molecule selected from the group consisting of SEQ ID 
NO: 1889 and SEQ ID NO: 1891. 
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Another embodiment of the present invention includes a flea PL4 protein encoded 
by a nucleic acid molecule that hybridizes under conditions comprising, (a) hybridizing in 
a solution comprising IX SSC and 0% formamide, at a temperature of about 37°C and (b) 
washing in a solution comprising IX SSC and 0% formamide, at a temperature of about 
47.5°C, to an isolated nucleic acid molecule selected from the group consisting of SEQ ID 
NO: 1893, SEQ ID NO: 1895, SEQ ID NO: 1898, and SEQ ID NO:1900. 

Another embodiment of the present invention includes a flea SVP protein encoded 
by a nucleic acid molecule that hybridizes under conditions comprising, (a) hybridizing in 
a solution comprising IX SSC and 0% formamide, at a temperature of about 37°C and (b) 
washing in a solution comprising IX SSC and 0% formamide, at a temperature of about 
47.5°C, to an isolated nucleic acid molecule selected from the group consisting of SEQ ID 
NO: 1903 and SEQ ID NO: 1905. 

Another embodiment of the present invention includes a flea VGCC protein 
encoded by a nucleic acid molecule that hybridizes under conditions comprising, (a) 
hybridizing in a solution comprising IX SSC and 0% formamide, at a temperature of 
about 37°C and (b) washing in a solution comprising IX SSC and 0% formamide, at a 
temperature of about 47.5°C, to an isolated nucleic acid molecule selected from the group 
consisting of SEQ ID NO:1907, SEQ ID NO: 1909, SEQ ID NO:1911, SEQ ID NO:1913, 
SEQ ID NO: 1916, and SEQ ID NO: 19 18. 

Another embodiment of the present invention includes a flea AUP protein 
encoded by a nucleic acid molecule that hybridizes under conditions comprising, (a) 
hybridizing in a solution comprising IX SSC and 0% formamide, at a temperature of 
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about 37°C and (b) washing in a solution comprising IX SSC and 0% formamide, at a 
temperature of about 47.5°C, to an isolated nucleic acid molecule selected from the group 
consisting of SEQ ID NO: 1921 and SEQ ID NO: 1923. 

Another embodiment of the present invention includes a flea 7B2 protein encoded 
by a nucleic acid molecule that hybridizes under conditions comprising, (a) hybridizing in 
a solution comprising IX SSC and 0% formamide, at a temperature of about 37°C and (b) 
washing in a solution comprising IX SSC and 0% formamide, at a temperature of about 
47.5°C, to an isolated nucleic acid molecule selected from the group consisting of SEQ ID 
NO: 1926, SEQ ID NO: 1928, and SEQ ID NO: 193 1 . 

Another embodiment of the present invention includes a flea HMT and/or HNC 
protein encoded by a nucleic acid molecule that hybridizes under conditions comprising, 
(a) hybridizing in a solution comprising IX SSC and 0% formamide, at a temperature of 
about 37°C and (b) washing in a solution comprising IX SSC and 0% formamide, at a 
temperature of about 47.5°C, to an isolated nucleic acid molecule selected from the group 
consisting of a nucleic acid sequence complementary to a nucleic acid sequence of Table 
I, Table H, Table EI and/or Table IV. 

Another preferred flea ALN protein of the present invention includes a protein 
that is encoded by a nucleic acid molecule that is preferably at least about 70% identical, 
more preferably at least about 75% identical, more preferably at least about 80% 
identical, more preferably about at least 85% identical, more preferably about at least 
90% identical, and even more preferably about at least 95% identical to a nucleic acid 
molecule having the nucleic acid sequence SEQ ID NO: 1 and/or SEQ ID NO:4; also 
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preferred are fragments (i.e. portions) of such proteins encoded by nucleic acid molecules 
that are at least about 18 nucleotides. Percent identity as used herein is determined using 
the Compare function by maximum matching within the program DNAsis Version 2.1 
using default parameters. 

Another preferred flea CBP protein of the present invention includes a protein that 
is encoded by a nucleic acid molecule that is preferably at least about 70% identical, more 
preferably at least about 75% identical, more preferably at least about 80% identical, 
more preferably at least about 85% identical, more preferably at least about 90% 
identical, and even more preferably at least about 95% identical to a nucleic acid 
molecule having the nucleic acid sequence SEQ ID NO:7 and/or SEQ ID NO: 10; also 
preferred are fragments (i.e. portions) of such proteins encoded by nucleic acid molecules 
that are at least about 18 nucleotides. Percent identity as used herein is determined using 
the Compare function by maximum matching within the program DNAsis Version 2. 1 
using default parameters. 

Another preferred flea NKAB protein of the present invention includes a protein 
that is encoded by a nucleic acid molecule that is preferably at least about 70% identical, 
more preferably at least about 75% identical, more preferably at least about 80% 
identical, more preferably at least about 85% identical, more preferably at least about 
90% identical, and even more preferably at least about 95% identical to a nucleic acid 
molecule having the nucleic acid sequence SEQ ID NO: 13 and/or SEQ ID NO: 16; also 
preferred are fragments (i.e. portions) of such proteins encoded by nucleic acid molecules 
that are at least about 18 nucleotides. Percent identity as used herein is determined using 
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the Compare function by maximum matching within the program DNAsis Version 2.1 
using default parameters. 

Another preferred flea LGIC protein of the present invention includes a protein 
that is encoded by a nucleic acid molecule that is preferably at least about 70% identical, 
more preferably at least about 75% identical, more preferably at least about 80% 
identical, more preferably at least about 85% identical, more preferably at least about 
90% identical, and even more preferably at least about 95% identical to a nucleic acid 
molecule having the nucleic acid sequence SEQ ID NO: 19, SEQ ID NO:22, SEQ ID 
NO: 1861, and/or SEQ ID NO: 1864; also preferred are fragments (i.e. portions) of such 
proteins encoded by nucleic acid molecules that are at least about 18 nucleotides. Percent 
identity as used herein is determined using the Compare function by maximum matching 
within the program DNAsis Version 2.1 using default parameters. 

Another preferred flea ANON protein of the present invention includes a protein 
that is encoded by a nucleic acid molecule that is preferably at least about 70% identical, 
more preferably at least about 75% identical, more preferably at least about 80% 
identical, more preferably at least about 85% identical, more preferably at least about 
90% identical, and even more preferably at least about 95% identical to a nucleic acid 
molecule having the nucleic acid sequence SEQ ID NO:25 and/or SEQ ID NO:28; also 
preferred are fragments (i.e. portions) of such proteins encoded by nucleic acid molecules 
that are at least about 18 nucleotides. Percent identity as used herein is determined using 
the Compare function by maximum matching within the program DNAsis Version 2.1 
using default parameters. 
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Another preferred flea MALV protein of the present invention includes a protein 
that is encoded by a nucleic acid molecule that is preferably at least about 70% identical, 
more preferably at least about 75% identical, more preferably at least about 80% 
identical, more preferably at least about 85% identical, more preferably at least about 
90% identical, and even more preferably at least about 95% identical to a nucleic acid 
molecule having the nucleic acid sequence SEQ ID NO:31 and/or SEQ ID NO:34; also 
preferred are fragments (i.e. portions) of such proteins encoded by nucleic acid molecules 
that are at least about 18 nucleotides. Percent identity as used herein is determined using 
the Compare function by maximum matching within the program DNAsis Version 2.1 
using default parameters. 

Another preferred flea OS-D protein of the present invention includes a protein 
that is encoded by a nucleic acid molecule that is preferably at least about 70% identical, 
more preferably at least about 75% identical, more preferably at least about 80% 
identical, more preferably at least about 85% identical, more preferably at least about 
90% identical, and even more preferably at least about 95% identical to a nucleic acid 
molecule having the nucleic acid sequence SEQ ID NO:37 and/or SEQ ID NO:40; also 
preferred are fragments (i.e. portions) of such proteins encoded by nucleic acid molecules 
that are at least about 1 8 nucleotides. Percent identity as used herein is determined using 
the Compare function by maximum matching within the program DNAsis Version 2.1 
using default parameters. 

Another preferred flea NMDA protein of the present invention includes a protein 
that is encoded by a nucleic acid molecule that is preferably at least about 70% identical, 
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more preferably at least about 75% identical, more preferably at least about 80% 
identical, more preferably at least about 85% identical, more preferably at least about 
90% identical, and even more preferably at least about 95% identical to a nucleic acid 
molecule having the nucleic acid sequence SEQ ID NO:43 and/or SEQ JD NO:46; also 
preferred are fragments (i.e. portions) of such proteins encoded by nucleic acid molecules 
that are at least about 18 nucleotides. Percent identity as used herein is determined using 
the Compare function by maximum matching within the program DNAsis Version 2.1 
using default parameters. 

Another preferred flea CLBP protein of the present invention includes a protein 
that is encoded by a nucleic acid molecule that is preferably at least about 70% identical, 
more preferably at least about 75% identical, more preferably at least about 80% 
identical, more preferably at least about 85% identical, more preferably at least about 
90% identical, and even more preferably at least about 95% identical to a nucleic acid 
molecule having the nucleic acid sequence SEQ ID NO:153, SEQ ID NO:156, SEQ ID 
NO:159, SEQ ID NO:162, SEQ ID NO:165 and/or SEQ ID NO:168; also preferred are 
fragments (i.e. portions) of such proteins encoded by nucleic acid molecules that are at 
least about 18 nucleotides. Percent identity as used herein is determined using the 
Compare function by maximum matching within the program DNAsis Version 2.1 using 
default parameters. 

Another preferred flea NAH protein of the present invention includes a protein 
that is encoded by a nucleic acid molecule that is preferably at least about 70% identical, 
more preferably at least about 75% identical, more preferably at least about 80% 
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identical, more preferably about at least 85% identical, more preferably about at least 
90% identical, and even more preferably about at least 95% identical to a nucleic acid 
molecule having the nucleic acid sequence SEQ ID NO: 1867 and/or SEQ ID NO: 1870; 
also preferred are fragments (i.e. portions) of such proteins encoded by nucleic acid 
molecules that are at least about 18 nucleotides. Percent identity as used herein is 
determined using the Compare function by maximum matching within the program 
DNAsis Version 2.1 using default parameters. 

Another preferred flea CLIC protein of the present invention includes a protein 
that is encoded by a nucleic acid molecule that is preferably at least about 70% identical, 
more preferably at least about 75% identical, more preferably at least about 80% 
identical, more preferably about at least 85% identical, more preferably about at least 
90% identical, and even more preferably about at least 95% identical to a nucleic acid 
molecule having the nucleic acid sequence SEQ ID NO: 1872 and/or SEQ ID NO: 1875; 
also preferred are fragments (i.e. portions) of such proteins encoded by nucleic acid 
molecules that are at least about 18 nucleotides. Percent identity as used herein is 
determined using the Compare function by maximum matching within the program 
DNAsis Version 2.1 using default parameters. 

Another preferred flea PL2 protein of the present invention includes a protein that 
is encoded by a nucleic acid molecule that is preferably at least about 70% identical, more 
preferably at least about 75% identical, more preferably at least about 80% identical, 
more preferably about at least 85% identical, more preferably about at least 90% 
identical, and even more preferably about at least 95% identical to a nucleic acid 
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molecule having the nucleic acid sequence SEQ ID NO: 1 877, SEQ ED NO: 1 878, SEQ ID 
NO: 1880, SEQ ID NO: 1882, and/or SEQ ID NO: 1885; also preferred are fragments (i.e. 
portions) of such proteins encoded by nucleic acid molecules that are at least about 18 
nucleotides. Percent identity as used herein is determined using the Compare function by 
5 maximum matching within the program DNAsis Version 2. 1 using default parameters. 

Another preferred flea PL3 protein of the present invention includes a protein that 
is encoded by a nucleic acid molecule that is preferably at least about 70% identical, more 
preferably at least about 75% identical, more preferably at least about 80% identical, 
more preferably about at least 85% identical, more preferably about at least 90% 

10 identical, and even more preferably about at least 95% identical to a nucleic acid 

molecule having the nucleic acid sequence SEQ ID NO: 1887 and/or SEQ ID NO: 1890; 
also preferred are fragments (i.e. portions) of such proteins encoded by nucleic acid 
molecules that are at least about 18 nucleotides. Percent identity as used herein is 
determined using the Compare function by maximum matching within the program 

1 5 DNAsis Version 2. 1 using default parameters. 

Another preferred flea PL4 protein of the present invention includes a protein that 
is encoded by a nucleic acid molecule that is preferably at least about 70% identical, more 
preferably at least about 75% identical, more preferably at least about 80% identical, 
more preferably about at least 85% identical, more preferably about at least 90% 

20 identical, and even more preferably about at least 95% identical to a nucleic acid 

molecule having the nucleic acid sequence SEQ ID NO:1892, SEQ ID NO:1894, SEQ ID 
NO:1896 and/or SEQ ID NO:1899; also preferred are fragments (i.e. portions) of such 
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proteins encoded by nucleic acid molecules that are at least about 18 nucleotides. Percent 
identity as used herein is determined using the Compare function by maximum matching 
within the program DNAsis Version 2.1 using default parameters. 

Another preferred flea SVP protein of the present invention includes a protein that 
is encoded by a nucleic acid molecule that is preferably at least about 70% identical, more 
preferably at least about 75% identical, more preferably at least about 80% identical, 
more preferably about at least 85% identical, more preferably about at least 90% 
identical, and even more preferably about at least 95% identical to a nucleic acid 
molecule having the nucleic acid sequence SEQ ID NO:1901 and/or SEQ ID NO:1904; 
also preferred are fragments (i.e. portions) of such proteins encoded by nucleic acid 
molecules that are at least about 1 8 nucleotides. Percent identity as used herein is 
determined using the Compare function by maximum matching within the program 
DNAsis Version 2.1 using default parameters. 

Another preferred flea VGCC protein of the present invention includes a protein 
that is encoded by a nucleic acid molecule that is preferably at least about 70% identical, 
more preferably at least about 75% identical, more preferably at least about 80% 
identical, more preferably about at least 85% identical, more preferably about at least 
90% identical, and even more preferably about at least 95% identical to a nucleic acid 
molecule having the nucleic acid sequence SEQ ID NO: 1906, SEQ ID NO: 1908, SEQ ID 
NO:1910, SEQ ID NO:1912, SEQ ID NO:1914 and/or SEQ ID NO:1917; also preferred 
are fragments (i.e. portions) of such proteins encoded by nucleic acid molecules that are 
at least about 18 nucleotides. Percent identity as used herein is determined using the 
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Compare function by maximum matching within the program DNAsis Version 2.1 using 
default parameters. 

Another preferred flea AUP protein of the present invention includes a protein 
that is encoded by a nucleic acid molecule that is preferably at least about 70% identical, 

5 more preferably at least about 75% identical, more preferably at least about 80% 

identical, more preferably about at least 85% identical, more preferably about at least 
90% identical, and even more preferably about at least 95% identical to a nucleic acid 
molecule having the nucleic acid sequence SEQ ID NO:1919 and/or SEQ ID NO:1922; 
also preferred are fragments (i.e. portions) of such proteins encoded by nucleic acid 

10 molecules that are at least about 18 nucleotides. Percent identity as used herein is 
determined using the Compare function by maximum matching within the program 
DNAsis Version 2.1 using default parameters. 

Another preferred flea 7B2 protein of the present invention includes a protein that 
is encoded by a nucleic acid molecule that is preferably at least about 70% identical, more 

15 preferably at least about 75% identical, more preferably at least about 80% identical, 
more preferably about at least 85% identical, more preferably about at least 90% 
identical, and even more preferably about at least 95% identical to a nucleic acid 
molecule having the nucleic acid sequence SEQ ID NO:1924, SEQ ID NO:1927 and/or 
SEQ ID NO: 1929; also preferred are fragments (i.e. portions) of such proteins encoded by 

20 nucleic acid molecules that are at least about 18 nucleotides. Percent identity as used 
herein is determined using the Compare function by maximum matching within the 
program DNAsis Version 2.1 using default parameters. 
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Another preferred flea HMT and/or HNC protein of the present invention includes 
a protein that is encoded by a nucleic acid molecule that is preferably at least about 70% 
identical more preferably at least about 75% identical, more preferably at least about 
80% identical, more preferably at least about 85% identical, more preferably at least 

5 about 90% identical, and even more preferably at least about 95% identical to a nucleic 
acid molecule having a nucleic acid sequence of Table I, Table n, Table m and/or Table 
IV; also preferred are fragments (i.e. portions) of such proteins encoded by nucleic acid 
molecules that are at least about 18 nucleotides. Percent identity as used herein is 
determined using the Compare function by maximum matching within the program 

10 DNAsis Version 2. 1 using default parameters. 

Additional preferred flea ALN proteins of the present invention include proteins 
having the amino acid sequence SEQ ED NO:2 or SEQ ID NO:5, and proteins comprising 
homologues of a protein having the amino acid sequence SEQ ID NO:2 or SEQ ID NO:5, 
wherein such a homologue comprises at least one epitope that elicits an immune response 

15 against a protein having an amino acid sequence SEQ ID NO:2 or SEQ ID NO:5. 
Likewise, also preferred are proteins encoded by nucleic acid molecules comprising 
nucleic acid sequence SEQ ID NO: 1 and/or SEQ ID NO:4, or by homologues thereof. 

Additional preferred flea CBP proteins of the present invention include proteins 
having the amino acid sequence SEQ ID NO:8 or SEQ ID NO: 1 1, and proteins 

20 comprising homologues of a protein having the amino acid sequence SEQ ID NO: 8 or 
SEQ ID NO:l 1, wherein such a homologue comprises at least one epitope that elicits an 
immune response against a protein having an amino acid sequence SEQ ID NO:8 or SEQ 
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ID NO: 1 1 . Likewise, also preferred are proteins encoded by nucleic acid molecules 
comprising nucleic acid sequence SEQ ID NO: 7 and/or SEQ ID NO: 10, or by 
homologues thereof. 

Additional preferred flea NKAB proteins of the present invention include proteins 
having the amino acid sequence SEQ ID NO: 14 or SEQ ID NO: 17, and proteins 
comprising homologues of a protein having the amino acid sequence SEQ ID NO: 14 or 
SEQ ID NO: 17, wherein such a homologue comprises at least one epitope that elicits an 
immune response against a protein having an amino acid sequence SEQ ID NO: 14 or 
SEQ ID NO: 17. Likewise, also preferred are proteins encoded by nucleic acid molecules 
comprising nucleic acid sequence SEQ ID NO: 13 and/or SEQ ID NO: 16, or by 
homologues thereof 

Additional preferred flea LGIC proteins of the present invention include proteins 
having the amino acid sequence SEQ ID NO:20, SEQ ID NO:23 or SEQ ID NO:1862, 
and proteins comprising homologues of a protein having the amino acid sequence SEQ 
ID NO:20, SEQ ID NO:23 or SEQ ID NO: 1862, wherein such a homologue comprises at 
least one epitope that elicits an immune response against a protein having an amino acid 
sequence SEQ ID NO:20, SEQ ID NO:23 or SEQ ID NO: 1862. Likewise, also preferred 
are proteins encoded by nucleic acid molecules comprising nucleic acid sequence SEQ ID 
NO: 19, SEQ ID NO:22, SEQ ID NO: 1859, SEQ ID NO: 1861 and/or SEQ ID NO: 1864 or 
by homologues thereof. 

Additional preferred flea ANON proteins of the present invention include proteins 
having the amino acid sequence SEQ JD NO:26 or SEQ ID NO:29, and proteins 
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comprising homologues of a protein having the amino acid sequence SEQ ID NO:26 or 
SEQ ID NO:29, wherein such a homologue comprises at least one epitope that elicits an 
immune response against a protein having an amino acid sequence SEQ ID NO:26 or 
SEQ ID NO:29. Likewise, also preferred are proteins encoded by nucleic acid molecules 
comprising nucleic acid sequence SEQ ED NO:25 and/or SEQ ID NO:28, or by 
homologues thereof. 

Additional preferred flea MALV proteins of the present invention include proteins 
having the amino acid sequence SEQ ID NO:32 or SEQ ID NO:35, and proteins 
comprising homologues of a protein having the amino acid sequence SEQ ID NO:32 or 
SEQ ID NO:35, wherein such a homologue comprises at least one epitope that elicits an 
immune response against a protein having an amino acid sequence SEQ ID NO:32 or 
SEQ ID NO:35. Likewise, also preferred are proteins encoded by nucleic acid molecules 
comprising nucleic acid sequence SEQ ED NO:3 1 and/or SEQ ID NO:34, or by 
homologues thereof. 

Additional preferred flea OS-D proteins of the present invention include proteins 
having the amino acid sequence SEQ ID NO:38 or SEQ ID NO:41, and proteins 
comprising homologues of a protein having the amino acid sequence SEQ ID NO:38 or 
SEQ ID NO:41, wherein such a homologue comprises at least one epitope that elicits an 
immune response against a protein having an amino acid sequence SEQ ID NO:38 or 
SEQ ID NO:41. Likewise, also preferred are proteins encoded by nucleic acid molecules 
comprising nucleic acid sequence SEQ ID NO:37 and/or SEQ ED NO:40, or by 
homologues thereof. 
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Additional preferred fleaNMDA proteins of the present invention include 
proteins having the amino acid sequence SEQ ID NO:44 or SEQ ID NO:47, and proteins 
comprising homologues of a protein having the amino acid sequence SEQ ID NO:44 or 
SEQ ID NO:47, wherein such a homologue comprises at least one epitope that elicits an 
immune response against a protein having an amino acid sequence SEQ ID NO:44 or 
SEQ ID NO:47. Likewise, also preferred are proteins encoded by nucleic acid molecules 
comprising nucleic acid sequence SEQ ID NO:43 and/or SEQ ID NO:46, or by 
homologues thereof. 

Additional preferred flea CLBP proteins of the present invention include proteins 
having the amino acid sequence SEQ ID NO: 154, SEQ ID NO: 157, SEQ ID NO: 160, 
SEQ ID NO: 163, SEQ ID NO: 166 or SEQ ID NO: 169, and proteins comprising 
homologues of a protein having the amino acid sequence SEQ ID NO: 154, SEQ ID 
NO: 157, SEQ ID NO: 160, SEQ ID NO: 163, SEQ ID NO: 166 or SEQ ID NO: 169, 
wherein such a homologue comprises at least one epitope that elicits an immune response 
against a protein having an amino acid sequence SEQ ID NO: 154, SEQ ID NO: 157, SEQ 
ID NO: 160, SEQ ID NO: 163, SEQ ID NO: 166 or SEQ ID NO: 169. Likewise, also 
preferred are proteins encoded by nucleic acid molecules comprising nucleic acid 
sequence SEQ ID NO:153, SEQ ID NO:156, SEQ ID NO:159, SEQ ID NO:162, SEQ ID 
NO: 165 and/or SEQ ID NO: 168, or by homologues thereof. 

Additional preferred flea NAH proteins of the present invention include proteins 
having the amino acid sequence SEQ ID NO: 1868, and proteins comprising homologues 
of a protein having the amino acid sequence SEQ ID NO:1868, wherein such a 
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homologue comprises at least one epitope that elicits an immune response against a 
protein having an amino acid sequence SEQ ID NO: 1868. Likewise, also preferred are 
proteins encoded by nucleic acid molecules comprising nucleic acid sequence SEQ ID 
NO: 1 867 and/or SEQ ID NO: 1 870, or by homologues thereof. 

Additional preferred flea CLIC proteins of the present invention include proteins 
having the amino acid sequence SEQ ID NO: 1873, and proteins comprising homologues 
of a protein having the amino acid sequence SEQ ID NO: 1 873, wherein such a 
homologue comprises at least one epitope that elicits an immune response against a 
protein having an amino acid sequence SEQ ID NO: 1873. Likewise, also preferred are 
proteins encoded by nucleic acid molecules comprising nucleic acid sequence SEQ ID 
NO: 1 872 and/or SEQ ID NO: 1 875, or by homologues thereof. 

Additional preferred flea PL2 proteins of the present invention include proteins 
having the amino acid sequence SEQ ID NO: 1883, and proteins comprising homologues 
of a protein having the amino acid sequence SEQ ID NO: 1883, wherein such a 
homologue comprises at least one epitope that elicits an immune response against a 
protein having an amino acid sequence SEQ ID NO: 1883. Likewise, also preferred are 
proteins encoded by nucleic acid molecules comprising nucleic acid sequence SEQ ID 
NO:1877, SEQ ID NO:1878, SEQ ID NO:1880, SEQ ID NO:1882 and/or SEQ ID 
NO: 1885, or by homologues thereof. 

Additional preferred flea PL3 proteins of the present invention include proteins 
having the amino acid sequence SEQ ID NO: 1888, and proteins comprising homologues 
of a protein having the amino acid sequence SEQ ID NO: 1888, wherein such a 
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homologue comprises at least one epitope that elicits an immune response against a 
protein having an amino acid sequence SEQ ID NO: 1888. Likewise, also preferred are 
proteins encoded by nucleic acid molecules comprising nucleic acid sequence SEQ ID 
NO: 1887 and/or SEQ ID NO: 1890, or by homologues thereof. 

Additional preferred flea PL4 proteins of the present invention include proteins 
having the amino acid sequence SEQ ID NO: 1897, and proteins comprising homologues 
of a protein having the amino acid sequence SEQ ID NO: 1897, wherein such a 
homologue comprises at least one epitope that elicits an immune response against a 
protein having an amino acid sequence SEQ ID NO: 1897. Likewise, also preferred are 
proteins encoded by nucleic acid molecules comprising nucleic acid sequence SEQ ID 
NO: 1892, SEQ ID NO: 1894, SEQ ED NO: 1896 and/or SEQ ID NO: 1899, or by 
homologues thereof. 

Additional preferred flea S VP proteins of the present invention include proteins 
having the amino acid sequence SEQ ID NO: 1902, and proteins comprising homologues 
of a protein having the amino acid sequence SEQ ID NO: 1902, wherein such a 
homologue comprises at least one epitope that elicits an immune response against a 
protein having an amino acid sequence SEQ ID NO: 1902. Likewise, also preferred are 
proteins encoded by nucleic acid molecules comprising nucleic acid sequence SEQ ID 
NO: 1901 and/or SEQ ID NO: 1904, or by homologues thereof. 

Additional preferred flea VGCC proteins of the present invention include proteins 
having the amino acid sequence SEQ ID NO: 1915, and proteins comprising homologues 
of a protein having the amino acid sequence SEQ ID NO: 1915, wherein such a 
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homologue comprises at least one epitope that elicits an immune response against a 
protein having an amino acid sequence SEQ ID NO: 1915. Likewise, also preferred are 
proteins encoded by nucleic acid molecules comprising nucleic acid sequence SEQ ID 
NO: 1906, SEQ ID NO: 1908, SEQ ID NO:1910, SEQ ID NO: 1912, SEQ ID NO: 1914 
and/or SEQ ID NO: 1917, or by homologues thereof. 

Additional preferred flea AUP proteins of the present invention include proteins 
having the amino acid sequence SEQ ED NO: 1920, and proteins comprising homologues 
of a protein having the amino acid sequence SEQ ID NO: 1920, wherein such a 
homologue comprises at least one epitope that elicits an immune response against a 
protein having an amino acid sequence SEQ ID NO: 1920. Likewise, also preferred are 
proteins encoded by nucleic acid molecules comprising nucleic acid sequence SEQ ID 
NO:1919 and/or SEQ ID NO:1922, or by homologues thereof. 

Additional preferred flea 7B2 proteins of the present invention include proteins 
having the amino acid sequence SEQ ID NO: 1925 or SEQ ID NO: 1930, and proteins 
comprising homologues of a protein having the amino acid sequence SEQ ID NO: 1925 or 
SEQ ID NO: 1930, wherein such a homologue comprises at least one epitope that elicits 
an immune response against a protein having an amino acid sequence SEQ ID NO: 1925 
or SEQ ID NO: 1930. Likewise, also preferred are proteins encoded by nucleic acid 
molecules comprising nucleic acid sequence SEQ ID NO:1924, SEQ ID NO: 1927 and/or 
SEQ ID NO: 1929, or by homologues thereof. 

Additional preferred flea HMT and/or HNC proteins of the present invention 
include proteins having an amino acid sequence encoded by a nucleic acid sequence of 
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Table I, Table II, Table m and/or Table IV, and proteins comprising homologues of a 
protein encoded by a nucleic acid sequence of Table I, Table H, Table HI and/or Table 
IV, wherein such a homologue comprises at least one epitope that elicits an immune 
response against a protein encoded by a nucleic acid sequence of Table I, Table n, Table 

5 EI and/or Table IV. 

A preferred isolated protein of the present invention is a protein encoded by at 
least one of the following nucleic acid molecules: nCfALN 2057 , nCfALN 1152 , nCfCBP 1128 , 
nCfCBP 816 , nCfNKAB m4 , nCfNKAB 978 , nCfLGIC 2240 , nCfLGIC 1707 , nCfANON 1429 , 
nCfANON 1194 , nCfMALV 765 , nCfMALV 762 , nCfOSD 604 , nCfOSD 405 , nCfNMDA 1227 , 

10 nCfNMDA 73g , nCfCLBP 1 A 6335 nCfCLBP 1 A m , nCfCLBP2A 631 , nCfCLBP2A 441> 
nCfLGIC 2739 , nC£LGIC 2016 , nCfNAH 20g05 nCfNAH 1824 , nCfCLIC 22g3 , nCfCLIC 786 , 
nCfPL2, 291 , nCfPL2 1173 , nCfPL3 406 , nCfPL3 243 , nCfPL4 974 , nCfPL4 l043 , nCfPL4 1062 , 
nCfPL4 855 , nCfSVP I875 , nCfSVP 1590 , nCfVGCC 38l , nCfVGCC 2191 , nCfVGCC 1968 , 
nCfVGCC 673 , nCfVGCC 3126 , nCfVGCC 2553 , nCfAUP 1181 , nCfAUP 306 , nCf7B2 2161 , 

15 nCf7B2 801 , nCf7B2 74 j or allelic variants of any of these nucleic acid molecules. Another 
preferred isolated protein is encoded by a nucleic acid molecule having nucleic acid 
sequence SEQ ID NO: 1, SEQ ID NO:4, SEQ ID NO:7, SEQ ID NO: 10, SEQ ID NO: 13, 
SEQ ID NO: 16, SEQ ID NO: 19, SEQ ID NO:22, SEQ ID NO:25, SEQ ID NO:28, SEQ 
ID NO:31, SEQ ID NO:34, SEQ ID NO:37, SEQ ID NO:40, SEQ ID NO:43, SEQ ID 

20 NO:46, SEQ ID NO:153, SEQ ID NO: 156, SEQ ID NO: 159, SEQ ID NO: 162, SEQ ID 
NO:165, SEQ ID NO: 168, SEQ ID NO:1859, SEQ ID NO:1861, SEQ ID NO:1864, SEQ 
ID NO:1867, SEQ ID NO:1870, SEQ ID NO:1872, SEQ ID NO:1875, SEQ ED NO:1877, 
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SEQ ID NO:1878, SEQ ID NO:1881, SEQ ID NO:1882, SEQ ID NO: 1885, SEQ ID 
NO: 1887, SEQ ID NO: 1890, SEQ ID NO: 1892, SEQ ID NO: 1894, SEQ ED NO: 1896, 
SEQ ID NO: 1899, SEQ ED NO: 1901, SEQ ED NO:1904, SEQ ED NO: 1906, SEQ ED 
NO: 1908, SEQ ED NO: 1910, SEQ ED NO: 1912, SEQ ED NO: 1914, SEQ ED NO: 19 17, 
SEQ ED NO: 19 19, SEQ ED NO: 1922, SEQ ED NO: 1924, SEQ ID NO: 1927, and/or SEQ 
ED NO: 1929; or a protein encoded by an allelic variant of any of these listed nucleic acid 
molecules. 

Preferred proteins of the present invention include proteins that are at least about 
70%, preferably at least about 80%, more preferably at least about 85%, even more 
preferably at least about 90%, even more preferably at least about 95%, and even more 
preferably about 100% identical to PCfALN 3g4 , PCfCBP 272 , PCfNKAB 326 , PCfLGIC 569 , 
PCfANON 39g , PCfMALV 2J4 , PCfOSD, 35 , PCfNMDA 246 , PCfCLBPlA 147 or 
PCfCLBP2A I47 . Additionally preferred are proteins encoded by allelic variants of a 
nucleic acid molecules encoding proteins PCfALN 384 , PCfCBP 272 , PCfNKAB 326 , 
PCfLGIC 569 , PCfANON 39g , PCfMALV 2S4 , PCfOSD 135 , PCfNMDA 246 , PCfCLBPlA 147 , 
PCfCLBP2A 147 PCfLGIC 672 , PCfNAH 60g , PCfCLIC 262 , PCfPL2 391) PCfPL3 8I , PCfPL4 2g5 , 
PCfSVP 530 , PCfVGCC gs „ PCfAUP 102 , PCf7B2 267 , PCf7B2 247 . Also preferred are 
fragments thereof having at least about 6 amino acid residues. 

Other preferred HMT and HNC proteins of the present invention include proteins 
having amino acid sequences that are at least about 70%, preferably at least about 80%, 
more preferably at least about 85%, even more preferably at least about 90%, even more 
preferably at least about 95%, and even more preferably about 100% identical to amino 
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acid sequence SEQ ID N0:2, SEQ ID N0:8, SEQ ID NO: 14, SEQ ID NO:20, SEQ ID 
NO:26, SEQ ID NO:32, SEQ ID NO:38, SEQ ID NO:44, SEQ ID NO:154, SEQ ID 
NO: 160, SEQ ID NO: 163, SEQ ID NO: 169, SEQ ID NO: 1862, SEQ ID NO: 1868, SEQ 
ID NO:1873, SEQ ID NO:1879, SEQ ID NO:1883, SEQ ID NO:1888, SEQ ID NO:1897, 
5 SEQ ID NO:1902, SEQ ID NO:1915, SEQ ID NO:1920, SEQ ID NO:1925, and/or SEQ 
ID NO: 1930; and proteins encoded by allelic variants of nucleic acid molecules encoding 
HMT and HNC proteins having amino acid sequences SEQ ID NO:2, SEQ ID NO:8, 
i SEQ ID NO:14, SEQ ID NO:20, SEQ ID NO:26, SEQ ID NO:32, SEQ ED NO:38, SEQ 

p ID NO:44, SEQ ID NO-.154, SEQ ID NO:160, SEQ ID NO:163, SEQ ID NO:169, SEQ 

P. 10 IDNO:1862, SEQ ID NO:1868, SEQIDNO:1873, SEQIDNO:1879, SEQ ID NO: 1883, 

yy 

f SEQ ID NO: 1888, SEQ ID NO: 1897, SEQ ID NO: 1902, SEQ IDNO:1915, SEQ ID 

M, NO: 1920, SEQ ID NO: 1925, and/or SEQ ID NO: 1930. Also preferred are fragments 

IIJ 

M= thereof having at least about 6 amino acid residues. 

w 

^ In one embodiment of the present invention, C. felis HMT and HNC proteins 

15 comprise amino acid sequence SEQ ID NO:2, SEQ ID NO:8, SEQ ID NO:14, SEQ ID 
NO:20, SEQ ID NO:26, SEQ ID NO:32, SEQ ID NO:38, SEQ ID NO:44, SEQ ID 
NO:154, SEQ ID NO:160, SEQ ID NO:163, SEQ ID NO:169, SEQ ID NO:1862, SEQ 
ID NO:1868, SEQ ID NO: 1873, SEQ ID NO:1879, SEQ ID NO:1883, SEQ ID NO: 1888, 
SEQ ID NO: 1897, SEQ ID NO: 1902, SEQ ID NO: 191 5, SEQ ID NO: 1920, SEQ ID 
20 NO: 1 925, and/or SEQ ID NO: 1930 (including, but not limited to, the proteins consisting 
of amino acid sequence SEQ ID NO:2, SEQ ID NO:8, SEQ ID NO: 14, SEQ ID NO:20, 
SEQ ID NO:26, SEQ ID NO:32, SEQ ID NO:38, SEQ ID NO:44, SEQ ID NO:154, SEQ 
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ID NO: 160, SEQ ID NO: 163, SEQ ID NO: 169, SEQ ID NO: 1862, SEQ ID NO: 1868, 
SEQ ID NO: 1873, SEQ ID NO: 1879, SEQ ID NO: 1883, SEQ ID NO: 1888, SEQ ID 
NO:1897, SEQ ID NO:1902, SEQ ID NO:1915, SEQ ID NO:1920, SEQ ID NO:1925, 
and/or SEQ ID NO: 1930, fusion proteins and multivalent proteins), and proteins encoded 
5 by allelic variants of nucleic acid molecules encoding proteins having amino acid 

sequence SEQ ID NO:2, SEQ ID NO:8, SEQ ID NO: 14, SEQ ID NO:20, SEQ ID NO:26, 
SEQ ID NO:32, SEQ ID NO:38, SEQ ID NO:44, SEQ ID NO:154, SEQ ID NO:160, 
J SEQ ID NO: 163, SEQ ID NO: 169, SEQ ID NO: 1 862, SEQ ID NO: 1868, SEQ ID 

rJLJ: 

5 NO:1873, SEQ ID NO:1879, SEQ ID N0.1883, SEQ ID NO:1888, SEQ ID NO:1897, 

10 SEQIDNO:1902,SEQIDNO:1915,SEQIDNO:1920,SEQIDNO:1925,and/orSEQ 

0i ID NO: 1930. 

s 

U Li one embodiment, a preferred flea HMT or HNC protein comprises an amino 

etc : 
J - : 
: tsF 

M= acid sequence of at least about 35 amino acids, preferably at least about 50 amino acids, 

H 8 more preferably at least about 100 amino acids, more preferably at least about 200 amino 

15 acids, more preferably at least about 250 amino acids, more preferably at least about 300 
amino acids, more preferably at least about 350 amino acids, more preferably at least 
about 400 amino acids, more preferably at least about 450 amino acids, more preferably 
at least about 500 amino acids, even more preferably at least about 550 amino acids, and 
even more preferably at least about 575 amino acids. In another embodiment, preferred 
20 flea HMT and HNC proteins comprise full-length proteins, i.e., proteins encoded by full- 
length coding regions, or post-translationally modified proteins thereof, such as mature 
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proteins from which initiating methionine and/or signal sequences or "pro" sequences 
have been removed. 

A fragment of an HMT and/or HNC protein of the present invention preferably 
comprises at least about 5 amino acids, more preferably at least about 10 amino acids, 
more preferably at least about 15 amino acids, more preferably at least about 20 amino 
acids, more preferably at least about 25 amino acids, more preferably at least about 30 
amino acids, more preferably at least about 35 amino acids, more preferably at least about 
40 amino acids, more preferably at least about 45 amino acids, more preferably at least 
about 50 amino acids, more preferably at least about 55 amino acids, more preferably at 
least about 60 amino acids, more preferably at least about 65 amino acids, more 
preferably at least about 70 amino acids, more preferably at least about 75 amino acids, 
more preferably at least about 80 amino acids, more preferably at least about 85 amino 
acids, more preferably at least about 90 amino acids, more preferably at least about 95 
amino acids, and even more preferably at least about 100 amino acids in length. 

Additional preferred HMT and HNC proteins of the present invention include 
proteins encoded by nucleic acid molecules comprising at least a portion of nC£ALN 2057 , 
nCfALN U52 , nCfCBP 1128 , nCfCBP 816 , nCfNKAB 1714 , nCfNKAB 978 , nCfiLGIC 2240 , 
nCfLGIC 1707 , nCfANON 1429 , nCfANON 1194 , nCfMALV 765 , nCfMALV 762 , nCfOSD 604 , 
nCfOSD 405 , nCfNMDA 1227 , nCfNMDA 738 , nCfCLBPlA 633 , nCfCLBPlA 441 , 
nCfCLBP2A 631 , nCfCLBP2A 441 , nCfLGIC 2739 , nCfLGIC 2016 , nCfNAH 2080 , nCfNAH 1824 , 
nCfCLIC 2283 , nCfCLIC 786 , nCfPL2 1291 , nCfPL2 1173 , nC£PL3 406 , nCfPL3 243 , nCfPL4 974 , 
nCfPL4 1043 , nCfPL4 1062 , nCfPL4 855 , nCfSVP 1875 , nCfSVP 1590 , nCfVGCC 381 , nCfVGCC 2191 , 
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nCfVGCC 1968 , nCrVGCC 673 , nCfVGCC 3126 , nCfVGCC 2S53 , nCfAUP 1181 , nCfAUP 306 , 
nCf7B2 2161 , nCf7B2 g01 , nCf7B2 741 as well as HMT and HNC proteins encoded by allelic 
variants of such nucleic acid molecules. A portion of such HMT and HNC nucleic acid 
molecule is preferably at least 18 nucleotides in length. 

5 Also preferred are HMT and HNC proteins encoded by nucleic acid molecules 

having nucleic acid sequences comprising at least a portion of SEQ ID NO:l, SEQ ID 
NO:4, SEQ ID NO:7, SEQ ID NO: 10, SEQ ID NO: 13, SEQ ID NO: 16, SEQ ID NO: 19, 
SEQ ID NO:22, SEQ ID NO:25, SEQ ID NO:28, SEQ ID NO:3 1, SEQ ID NO:34, SEQ 
ID NO:37, SEQ ID NO:40, SEQ ID NO:43, SEQ ID NO:46, SEQ ID NO:153, SEQ ID 

10 NO:156, SEQ ID NO:159, SEQ ID NO:162, SEQ ED NO:165, SEQ ID NO:168, SEQ ID 
NO:1859, SEQ ID NO:1861, SEQ ID NO:1864, SEQ ID NO:1867, SEQ ID NO:1870, 
SEQ ID NO: 1872, SEQ ID NO: 1875, SEQ ID NO: 1877, SEQ ID NO: 1878, SEQ ID 
NO: 1881, SEQ ID NO: 1882, SEQ ID NO: 1885, SEQ ID NO: 1887, SEQ ID NO: 1890, 
SEQ ID NO:1892, SEQ ID NO:1894, SEQ ID NO: 1896, SEQ ID NO:1899, SEQ ID 

15 NO: 1901, SEQ ID NO: 1904, SEQ ID NO: 1906, SEQ ID NO: 1908, SEQ ID NO: 1910, 
SEQ ID NO:1912, SEQ ID NO:1914, SEQ ID NO: 19 17, SEQ ID NO:1919, SEQ ID 
NO: 1922, SEQ ID NO: 1924, SEQ ID NO: 1927, and/or SEQ ID NO: 1929, as well as 
allelic variants of these nucleic acid molecules. A portion of such HMT and HNC nucleic 
acid molecule is preferably at least 1 8 nucleotides in length. 

20 In another embodiment, a preferred flea HMT and/or HNC protein of the present 

invention is encoded by a nucleic acid molecule comprising at least about 15 nucleotides, 
more preferably at least about 18 nucleotides, more preferably at least about 20 
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nucleotides, more preferably at least about 25 nucleotides, more preferably at least about 
30 nucleotides, more preferably at least about 40 nucleotides, more preferably at least 
about 50 nucleotides, more preferably at least about 100 nucleotides, more preferably at 
least about 150 nucleotides, more preferably at least about 350 nucleotides, more 

5 preferably at least about 450 nucleotides, more preferably at least about 550 nucleotides, 
more preferably at least about 650 nucleotides, more preferably at least about 750 
nucleotides, more preferably at least about 1000 nucleotides, more preferably at least 
about 1500 nucleotides, more preferably at least about 1750 nucleotides more preferably 
at least about 2000 nucleotides, and even more preferably at least about 2250 nucleotides 

10 in length. Within this embodiment is a HMT protein encoded by at least a portion of 
nCfALN 2057 , nCfALN U52 , nCfCBP 1I28 , nCfCBP 816 , nCfNKAB I7143 nCfNKAB 978 
nCfLGIC 2240 , nCfLGIC l707 , nCfANON 1429 , nCfANON 1194 , nCfMALV 765 , nCfMALV 762 , 
nCfOSD 604 , nCfOSD 405 , nCfNMDA l227 , nCfNMDA 738 , nCfCLBPlA 633 , nCfCLBPlA 441 , 
nCfCLBP2A 631 , nCfCLBP2A 441 , nCfLGIC 2739 , nCfLGIC 20I6 , nCfNAH 2080 , nCfNAH 1824 , 

15 nCfCLIC 2283 , nCfCLIC 786 , nCfPL2 1291 , nCfPL2 U73 , nCfPL3 406 , nCfPL3 243 , nCfPL4 974 , 

nCfPL4 1043 , nCfPL4 1062 , nCfPL4 855 , nCfSVP 1875 , nCfSVP 1590 , nCfVGCC 381 , nCfVGCC 2191 , 
nCfVGCC 1968 , nCfVGCC 673 , nCfVGCC 3126 , nCfVGCC 2553 , nCfAUP 1181 , nC£AUP 306 , 
nCf7B2 2161 , nCf7B2 80l5 nCf7B2 741 or by an allelic variant of any of these nucleic acid 
molecules. In yet another embodiment, preferred flea HMT and HNC proteins of the 

20 present invention are encoded by nucleic acid molecules comprising apparently full- 
length HMT or HNC coding regions respectively, i.e., nucleic acid molecules encoding an 
apparently full-length HMT or HNC proteins. 
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Preferred flea HMT and HNC proteins of the present invention can be used to 
develop inhibitors that, when administered to an animal in an effective manner, are 
capable of protecting that animal from flea infestation. In accordance with the present 
invention, the ability of an inhibitor of the present invention to protect an animal from 
flea infestation refers to the ability of that protein to, for example, treat, ameliorate and/or 
prevent infestation caused by fleas. In particular, the phrase "to protect an animal from 
flea infestation" refers to reducing the potential for flea population expansion on and 
around the animal (i.e., reducing the flea burden). Preferably, the flea population size is 
decreased, optimally to an extent that the animal is no longer bothered by fleas. A host 
animal, as used herein, is an animal from which fleas can feed by attaching to and feeding 
through the skin of the animal. Fleas, and other ectoparasites, can live on a host animal 
for an extended period of time or can attach temporarily to an animal in order to feed. At 
any given time, a certain percentage of a flea population can be on a host animal whereas 
the remainder can be in the environment of the animal. Such an environment can include 
not only adult fleas, but also flea eggs and/or flea larvae. The environment can be of any 
size such that fleas in the environment are able to jump onto and off of a host animal. For 
example, the environment of an animal can include plants, such as crops, from which 
fleas infest an animal. As such, it is desirable not only to reduce the flea burden on an 
animal per se, but also to reduce the flea burden in the environment of the animal. 

Suitable fleas to target include any flea that is essentially incapable of causing 
disease in an animal administered an inhibitor of the present invention. As such, fleas to 
target include any flea that produces a protein that can be targeted by an inhibitory 
compound that inhibits a flea HMT or HNC protein function, thereby resulting in the 
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decreased ability of the parasite to cause disease in an animal. Preferred fleas to target 
include fleas of the following genera: Ctenocephalides, Cyopsyllus, Diamanus 
(Oropsylla), Echidnophaga, Nosopsyllus, Pulex, Tunga, and Xenopsylla, with those of the 
species Ctenocephalides canis, Ctenocephalides felis, Diamanus montanus, 

5 Echidnophaga gallinacea, Nosopsyllus faciatus, Pulex irritans, Pulex simulans, Tunga 
penetrans and Xenopsylla cheopis being more preferred, with C. felis being even more 
preferred. Such fleas are also preferred for the isolation of proteins or nucleic acid 
molecules of the present invention. 

One embodiment of a flea HMT and/or HNC protein of the present invention is a 

10 fusion protein that includes a flea HMT and/or HNC protein-containing domain attached 
to one or more fusion segments. Suitable fusion segments for use with the present 
invention include, but are not limited to, segments that can: enhance a protein's stability; 
act as an immunopotentiator to enhance an immune response against a flea HMT and/or 
HNC protein; and/or assist in purification of a flea HMT and/or HNC protein (e.g., by 

15 affinity chromatography). A suitable fusion segment can be a domain of any size that has 
the desired function (e.g., imparts increased stability, imparts increased immunogenicity 
to a protein, and/or simplifies purification of a protein). Fusion segments can be joined to 
amino and/or carboxyl termini of the flea HMT-containing and/or HNC-containing 
domain of the protein and can be susceptible to cleavage in order to enable straight- 

20 forward recovery of a flea HMT and/or HNC protein. Fusion proteins are preferably 

produced by culturing a recombinant cell transformed with a fusion nucleic acid molecule 
that encodes a protein including the fusion segment attached to either the carboxyl and/or 
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amino terminal end of an HMT-containing and/or HNC-containing domain. Preferred 
fusion segments include a metal binding domain (e.g., a poly-histidine segment); an 
immunoglobulin binding domain (e.g., Protein A; Protein G; T cell; B cell; Fc receptor or 
complement protein antibody-binding domains); a sugar binding domain (e.g., a maltose 

5 binding domain); and/or a "tag" domain (e.g., at least a portion of (3-galactosidase, a strep 
tag peptide, a T7 tag peptide, a Flag™ peptide, or other domains that can be purified 
using compounds that bind to the domain, such as monoclonal antibodies). More 
preferred fusion segments include metal binding domains, such as a poly-histidine 
segment; a maltose binding domain; a strep tag peptide, such as that available from 

10 Biometra in Tampa, FL; and an S10 peptide. 

The present invention also includes mimetopes of flea HMT and/or HNC proteins 
of the present invention. As used herein, a mimetope of a flea HMT and/or HNC protein 
of the present invention refers to any compound that is able to mimic the activity of such 
an HMT and/or HNC protein, often because the mimetope has a structure that mimics the 

15 particular HMT and/or HNC protein. Mimetopes can be, but are not limited to: peptides 
that have been modified to decrease their susceptibility to degradation such as all-D retro 
peptides; anti-idiotypic and/or catalytic antibodies, or fragments thereof; non- 
proteinaceous immunogenic portions of an isolated protein (e.g., carbohydrate structures); 
and synthetic or natural organic molecules, including nucleic acids. Such mimetopes can 

20 be designed using computer-generated structures of proteins of the present invention. 
Mimetopes can also be obtained by generating random samples of molecules, such as 
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oligonucleotides, peptides or other organic molecules, and screening such samples by 
affinity chromatography techniques using the corresponding binding partner. 

Another embodiment of the present invention is an isolated nucleic acid molecule 
comprising a flea HMT and/or HNC nucleic acid molecule, i.e. a nucleic acid molecule 
that can be isolated from a HMT cDNA library, from a HNC cDNA library, or from both 
libraries. As used herein, HMT and HNC nucleic acid molecules has the same meaning 
as HMT and/or HNC nucleic acid molecule. The identifying characteristics of such 
nucleic acid molecules are heretofore described. A nucleic acid molecule of the present 
invention can include an isolated natural flea HMT and/or HNC gene or a homologue 
thereof, the latter of which is described in more detail below. A nucleic acid molecule of 
the present invention can include one or more regulatory regions, full-length or partial 
coding regions, or combinations thereof. The minimal size of a nucleic acid molecule of 
the present invention is a size sufficient to allow the formation of a stable hybrid (i.e., 
hybridization under stringent hybridization conditions) with the complementary sequence 
of another nucleic acid molecule. As such, the minimal size of a HMT and/or HNC 
nucleic acid molecule of the present invention is from about 12 to about 18 nucleotides in 
length. Suitable and preferred fleas from which to isolate nucleic acid molecules of the 
present invention are disclosed herein. Particularly preferred HMT and/or HNC nucleic 
acid molecules include C. felis HMT and/or HNC nucleic acid molecules. 

In accordance with the present invention, an isolated nucleic acid molecule is a 
nucleic acid molecule that has been removed from its natural milieu (i.e., that has been 
subjected to human manipulation) and can include DNA, RNA, or derivatives of either 
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DNA or RNA. As such, "isolated" does not reflect the extent to which the nucleic acid 
molecule has been purified. Isolated flea HMT and/or HNC nucleic acid molecules of the 
present invention, or homologues thereof, can be isolated from a natural source or 
produced using recombinant DNA technology (e.g., polymerase chain reaction (PCR) 

5 amplification or cloning) or chemical synthesis. Isolated flea HMT and/or HNC nucleic 
acid molecules, and homologues thereof, can include, for example, natural allelic variants 
and nucleic acid molecules modified by nucleotide insertions, deletions, substitutions, 
and/or inversions in a manner such that the modifications do not substantially interfere 
with the nucleic acid molecule's ability to encode a HMT and/or HNC protein of the 

1 0 present invention. 

A flea HMT and/or HNC nucleic acid molecule homologue can be produced using 
a number of methods known to those skilled in the art, see, for example, Sambrook et al, 
ibid., is incorporated by reference herein in its entirety. For example, nucleic acid 
molecules can be modified using a variety of techniques including, but not limited to, 

15 classic mutagenesis and recombinant DNA techniques such as site-directed mutagenesis, 
chemical treatment, restriction enzyme cleavage, ligation of nucleic acid fragments, PCR 
amplification, synthesis of oligonucleotide mixtures and ligation of mixture groups to 
"build" a mixture of nucleic acid molecules, and combinations thereof. Nucleic acid 
molecule homologues can be selected by hybridization with flea HMT and/or HNC 

20 nucleic acid molecules or by screening the function of a protein encoded by the nucleic 
acid molecule (e.g., ability to elicit an immune response against at least one epitope of a 
flea HMT or HNC protein or to effect HMT or HNC activity). 
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An isolated nucleic acid molecule of the present invention can include a nucleic 
acid sequence that encodes at least one flea HMT or HNC protein of the present 
invention, examples of such proteins being disclosed herein. Although the phrase 
"nucleic acid molecule" primarily refers to the physical nucleic acid molecule and the 

5 phrase "nucleic acid sequence" primarily refers to the sequence of nucleotides on the 
nucleic acid molecule, the two phrases can be used interchangeably, especially with 
respect to a nucleic acid molecule, or a nucleic acid sequence, being capable of encoding 
a flea HMT or HNC protein. 

A preferred nucleic acid molecule of the present invention, when administered to 

10 an animal, is capable of protecting that animal from flea infestation. As will be disclosed 
in more detail below, such a nucleic acid molecule can be, or encode, an antisense RNA, 
a molecule capable of triple helix formation, a ribozyme, or other nucleic acid-based drug 
compound. In additional embodiments, a nucleic acid molecule of the present invention 
can encode a protective protein (e.g., an HMT or HNC protein of the present invention), 

15 the nucleic acid molecule being delivered to the animal, for example, by direct injection 
(i.e, as a genetic vaccine) or in a vehicle such as a recombinant virus vaccine or a 
recombinant cell vaccine. 

In one embodiment of the present invention, a preferred flea HMT and/or HNC 
nucleic acid molecule includes an isolated nucleic acid molecule that hybridizes under 

20 conditions that preferably allow less than or equal to about 30% base pair mismatch, more 
preferably under conditions that allow less than or equal to about 25% base pair 
mismatch, more preferably under conditions that allow less than or equal to about 20% 
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base pair mismatch, more preferably under conditions that allow less than or equal to 
about 15% base pair mismatch, more preferably under conditions that allow less than or 
equal to about 10% base pair mismatch and even more preferably under conditions that 
allow less than or equal to about 5% base pair mismatch with a nucleic acid molecule 

5 selected from the group consisting of SEQ ID NO: 1, SEQ ID NO:3, SEQ ID NO:4, SEQ 
ID NO:6, SEQ ID NO:7, SEQ ID NO:9, SEQ ID NO: 10, SEQ ID NO: 12, SEQ ID NO: 13, 
SEQ ID NO: 15, SEQ ID NO: 16, SEQ ID NO: 18, SEQ ID NO: 19, SEQ ID NO:21, SEQ 
ID NO:22, SEQ ID NO:24, SEQ ID NO:25, SEQ ID NO:27, SEQ ID NO:28, SEQ ID 
NO:30, SEQ ID NO:31, SEQ ID NO:33, SEQ ID NO:34, SEQ ID NO:36, SEQ ID 

10 NO:37, SEQ ID NO:39, SEQ ID NO:40, SEQ ID NO:42, SEQ ID NO:43, SEQ ED 
NO:45, SEQ ID NO:46, SEQ ID NO:48, SEQ ID NO: 153, SEQ ID NO: 155, SEQ ID 
NO:156, SEQIDNO:158, SEQIDNO:159, SEQIDNO:161, SEQIDNO:162, SEQ ID 
NO: 164, SEQ ID NO: 165, SEQ ID NO: 167, SEQ ID NO: 168, SEQ ID NO: 170, SEQ ID 
NO:1859, SEQ ID NO:1860, SEQ ID NO:1861, SEQ ID NO:1863, SEQ ID NO:1864, 

15 SEQ ID NO: 1866, SEQ ID NO: 1867, SEQ ID NO: 1869, SEQ ID NO: 1870, SEQ ID 
NO:1871, SEQ ID NO:1872, SEQ ID NO:1874, SEQ ID NO: 1875, SEQ ID NO:1876, 
SEQ ID NO: 1877, SEQ ID NO: 1878, SEQ ID NO: 1880, SEQ ID NO: 1881, SEQ ID 
NO: 1882, SEQ ID NO: 1884, SEQ ID NO: 1885, SEQ ID NO: 1886, SEQ ID NO: 1887, 
SEQ ID NO: 1889, SEQ ID NO: 1890, SEQ ID NO: 1891, SEQ ID NO: 1892, SEQ ID 

20 NO: 1893, SEQ ID NO:1894, SEQ ID NO:1895, SEQ ID NO: 1896, SEQ ID NO: 1898, 
SEQ ID NO:1899, SEQ ID NO:1900, SEQ ID NO:1901, SEQ ID NO:1903, SEQ ID 
NO:1904, SEQ ID NO: 1905, SEQIDNO:1906, SEQ ID NO: 1907, SEQ ID NO: 1908, 
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SEQ ID NO : 1 909, SEQ ID NO: 1 9 1 0, SEQ ID NO : 1 9 1 1 , SEQ ID NO: 1 9 1 2, SEQ ID 
NO:1913, SEQ ID NO:1914, SEQ ID NO:1916, SEQ ID NO:1917, SEQ ID NO:1918, 
SEQ ID NO:1919, SEQ ID NO: 1921, SEQ ID NO:1922, SEQ ID NO:1923, SEQ ID 
NO: 1924, SEQ ID NO: 1926, SEQ ID NO: 1927, SEQ ID NO: 1928, SEQ ID NO: 1929, 

5 and/or SEQ ID NO : 1 93 1 , a nucleic acid molecule of Table I, Table n, Table HI or Table 
IV and/or a nucleic acid molecule that is complementary to a nucleic acid molecule of 
Table I, Table n, Table m or Table IV. 

Another embodiment of the present invention includes a HMT and/or HNC 
nucleic acid molecule, wherein said nucleic acid molecule hybridizes, in a solution 

10 comprising IX SSC and 0% formamide, at a temperature of about 47.5°C, to an isolated 
nucleic acid molecule selected from the group consisting of SEQ ID NO: 1, SEQ ID 
NO:3, SEQ ID NO:4, SEQ ID NO:6, SEQ ID NO:7, SEQ ID NO:9, SEQ ID NO: 10, SEQ 
ID NO: 12, SEQ ID NO: 13, SEQ ID NO: 15, SEQ ID NO: 16, SEQ ID NO: 18, SEQ ID 
NO:19, SEQ ID NO:21, SEQ ID NO:22, SEQ ID NO:24, SEQ ID NO:25, SEQ ID 

15 NO:27, SEQ IDNO:28, SEQ ID NO:30, SEQ ID NO:31, SEQ ID NO:33, SEQ ID 
NO:34, SEQ ID NO:36, SEQ ID NO:37, SEQ ID NO:39, SEQ ID NO:40, SEQ ID 
NO:42, SEQ ID NO:43, SEQ ID NO:45, SEQ ID NO:46, SEQ ID NO:48, SEQ ID 
NO: 153, SEQ ID NO: 155, SEQ ID NO: 156, SEQ ID NO: 158, SEQ ID NO: 159, SEQ ID 
NO:161, SEQ ID NO:162, SEQ ID NO: 164, SEQ ID NO:165, SEQ ID NO:167, SEQ ID 

20 NO:168, SEQ ID NO:170, SEQ ID NO:1859, SEQ ID NO:1860, SEQ ID NO:1861, SEQ 
ID NO:1863, SEQ ID NO:1864, SEQ ID NO:1866, SEQ ID NO:1867, SEQ ID NO:1869, 
SEQ ID NO: 1 870, SEQ ID NO: 187 1, SEQ ID NO: 1 872, SEQ ID NO: 1 874, SEQ ID 
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NO: 1 875, SEQ ID NO: 1 876, SEQ ID NO: 1 877, SEQ ID NO: 1 878, SEQ ID NO: 1 880, 
SEQ ID NO:1881, SEQ ID NO:1882, SEQ ID NO:1884, SEQ ID NO:1885, SEQ ID 
NO: 1886, SEQ ID NO: 1887, SEQ ID NO: 1889, SEQ ID NO: 1890, SEQ ID NO: 1891, 
SEQ ID N0.1892, SEQ ID NO:1893, SEQ ID NO:1894, SEQ ID NO:1895, SEQ ID 
5 NO: 1896, SEQ ID NO: 1 898, SEQ ID NO:l 899, SEQ ID NO: 1900, SEQ ID NO: 1901, 
SEQ ID NO: 1903, SEQ ID NO: 1904, SEQ ID NO:1905, SEQ ED NO:1906, SEQ ID 
NO:1907, SEQ ID NO:1908, SEQ ID NO:1909, SEQ ID NO:1910, SEQ ID NO:1911, 

0 

S SEQ ID NO: 1912, SEQ ID NO: 1913, SEQ ID NO: 1914, SEQ IDNO:1916, SEQ ID 

*fi NO:1917, SEQ ID NO:1918, SEQ ID NO:1919, SEQ ID NO:1921, SEQ ID NO:1922, 

10 SEQ ID NO: 1923, SEQ ID NO: 1924, SEQ ID NO: 1926, SEQ IDNO:1927, SEQ ID 
I ' NO: 1928, SEQ ID NO: 1929, and/or SEQ ID NO: 193 1, a nucleic acid molecule of 

i s 

M Table I, Table E, Table IE or Table IV and/or a nucleic acid molecule that is 

Hi 

H- complementary to a nucleic acid molecule of Table I, Table II, Table EI or Table IV. 

: : 

^ Additional preferred nucleic acid molecules of the present invention include 

15 oligonucleotides of an isolated nucleic acid molecule, wherein said nucleic acid molecule 
hybridizes, in a solution comprising IX SSC and 0% formamide, at a temperature of 
about 47.5°C, to an isolated nucleic acid molecule selected from the group consisting of 
SEQ ID NO:l, SEQ ID NO:3, SEQ ID NO:4, SEQ ID NO:6, SEQ ID NO:7, SEQ ID 
NO:9, SEQ ID NO: 10, SEQ ID NO: 12, SEQ ID NO: 13, SEQ ID NO: 15, SEQ ID NO: 16, 
20 SEQ ID NO:18, SEQ ID NO:19, SEQ ID NO:21, SEQ ID NO:22, SEQ ID NO:24, SEQ 
ID NO:25, SEQ ID NO:27, SEQ ID NO:28, SEQ ID NO:30, SEQ ID NO:31, SEQ ID 
NO:33, SEQ ID NO:34, SEQ ID NO:36, SEQ ID NO:37, SEQ ID NO:39, SEQ ID 
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NO:40, SEQ ID NO:42, SEQ ID NO:43 3 SEQ ID NO:45, SEQ ID NO:46, SEQ ID 
NO:48, SEQ ID NO: 153, SEQ ID NO: 155, SEQ ED NO: 156, SEQ ID NO: 158, SEQ ED 
NO: 159, SEQ ID NO: 161, SEQ ED NO: 162, SEQ ID NO: 164, SEQ ED NO: 165, SEQ ID 
NO: 167, SEQ ID NO: 168, SEQ ED NO: 170, SEQ ID NO: 1859, SEQ ID NO: I860, SEQ 

5 ID NO: 1861, SEQ ED NO: 1863, SEQ ED NO: 1864, SEQ ID NO: 1866, SEQ ID NO: 1867, 
SEQ ID NO: 1869, SEQ ID NO: 1870, SEQ ID NO: 1871, SEQ ED NO:1872, SEQ ED 
NO: 1874, SEQ ID NO:1875, SEQIDNO:1876, SEQ ED NO: 1877, SEQ ED NO: 1878, 
SEQ ID NO: 1880, SEQ ID NO: 1881, SEQ ID NO: 1882, SEQ ID NO: 1884, SEQ ID 
NO: 1885, SEQ ID NO: 1886, SEQ ID NO: 1 887, SEQ ID NO: 1889, SEQ ID NO: 1 890, 

10 SEQ ID NO:1891, SEQ ED NO:1892, SEQ ED NO:1893, SEQ ID NO:1894, SEQ ID 
NO: 1895, SEQ ED NO:1896, SEQ ED NO:1898, SEQ ED NO:1899, SEQ ID NO:1900, 
SEQ ED NO: 1901, SEQ ID NO:1903, SEQ ID NO: 1904, SEQ ID NO: 1905, SEQ ID 
NO:1906, SEQ ID NO:1907, SEQ ID NO:1908, SEQ ED NO:1909, SEQ ID NO:1910, 
SEQEDNO:1911,SEQIDNO:1912, SEQ ED NO: 1913, SEQ ID NO: 1914, SEQ ID 

15 NO:1916, SEQ ID NO:1917, SEQ ED NO:1918, SEQ ID NO:1919, SEQ ID NO:1921, 
SEQ ED NO: 1922, SEQ ID NO: 1923, SEQ ED NO: 1924, SEQ ID NO: 1926, SEQ ED 
NO: 1927, SEQ ED NO: 1928, SEQ ID NO: 1929, and/or SEQ ID NO: 1931, a nucleic acid 
molecule of Table I, Table II, Table DI or Table IV and/or a nucleic acid molecule that is 
complementary to a nucleic acid molecule of Table I, Table II, Table HI or Table IV, 

20 wherein said oligonucleotide comprises at least about 18 nucleotides. 

Additional preferred flea HMT and/or HNC nucleic acid molecules of the present 
invention include nucleic acid molecules comprising a nucleic acid sequence that is 
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preferably at least about 70%, more preferably at least about 75%, more preferably at 
least about 80% more preferably at least about 85%, more preferably at least about 90%, 
and even more preferably at least about 95% identical to a nucleic acid sequence selected 
from the group consisting of SEQ ID NO: 1, SEQ ED NO:3, SEQ ID NO:4, SEQ ID NO:6, 

5 SEQ ID NO:7, SEQ ID NO:9, SEQ ID NO: 10, SEQ ID NO: 12, SEQ ID NO: 13, SEQ ED 
NO: 15, SEQ ID NO: 16, SEQ ID NO: 18, SEQ ID NO: 19, SEQ ID NO:21, SEQ ID 
NO:22, SEQ ID NO:24, SEQ ID NO:25, SEQ ID NO:27, SEQ ID NO:28, SEQ ID 
NO:30, SEQ ID NO:3 1, SEQ ID NO:33, SEQ ID NO:34, SEQ ID NO:36, SEQ ED 
NO:37, SEQ ID NO:39, SEQ ED NO:40, SEQ ID NO:42, SEQ ID NO:43, SEQ ID 

10 NO:45, SEQ ID NO:46, SEQ ED NO:48, SEQ ED NO: 153, SEQ ED NO: 155, SEQ ID 
NO: 156, SEQ ED NO: 158, SEQ ID NO: 159, SEQ ID NO: 161, SEQ ED NO: 162, SEQ ED 
NO: 164, SEQ ED NO: 165, SEQ ID NO: 167, SEQ ID NO:168, SEQ ID NO: 170, SEQ ID 
NO:1859, SEQ ID NO:1860, SEQ ID NO:1861, SEQ ID NO:1863, SEQ ID NO:1864, 
SEQ ID NO: 1 866, SEQ ID NO : 1 867, SEQ ID NO: 1 869, SEQ ID NO: 1 870, SEQ ED 

15 NO:1871, SEQ ID NO:1872, SEQ ID NO:1874, SEQ ID NO:1875, SEQ ID NO:1876, 
SEQ ID NO: 1877, SEQ ID NO: 1878, SEQ ID NO: 1880, SEQ ID NO: 1881, SEQ ED 
NO: 1882, SEQ ID NO: 1884, SEQ ID NO: 1885, SEQ ED NO: 1886, SEQ ID NO: 1887, 
SEQ ID NO: 1889, SEQ ID NO: 1890, SEQ ID NO: 1891, SEQ ID NO: 1892, SEQ ID 
NO:1893, SEQ ID NO:1894, SEQ ID NO:1895, SEQ ED NO:1896, SEQ ID NO:1898, 

20 SEQ ED NO:1899, SEQ ID NO:1900, SEQ ID NO:1901, SEQ ID NO:1903, SEQ ED 
NO: 1904, SEQ ID NO: 1905, SEQ ED NO: 1906, SEQ ID NO: 1907, SEQ ID NO: 1908, 
SEQ ED NO:1909, SEQ IDNO:1910, SEQ ID NO: 1911, SEQ ID NO:1912, SEQ ED 
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NO:1913, SEQ ID NO:1914, SEQ ID NO:1916, SEQ ID NO:1917, SEQ ID NO:1918, 
SEQ ID NO:1919, SEQ ID NO:1921, SEQ ID NO:1922, SEQ ID NO: 1923, SEQ ID 
NO:1924, SEQ ID NO:1926, SEQ ID NO:1927, SEQ ID NO:1928, SEQ ID NO:1929, 
and/or SEQ ID NO : 1 93 1 , a nucleic acid molecule of Table I, Table n, Table m or Table 

5 IV and/or a nucleic acid molecule that is complementary to a nucleic acid molecule of 
Table I, Table II, Table III or Table IV. Also preferred are oligonucleotides of any of 
such nucleic acid molecules. Percent identity may be determined using the GCG™ 
Wisconsin Package Version 9.0 sequence analysis software, using default parameters. 
One embodiment of the present invention is a nucleic acid molecule comprising 

10 all or part of nucleic acid molecules nCfALN 2057 , nCfALN, , 52 , nCfCBP mg , nCfCBP gl6 , 
nCfNKAB, 714 , nCfNKAB 97g nCfLGIC 2240 , nCfLGIC l707 , nCfANON I429 , nCfANON„ 94 , 
nCfMALV 765 , nCfMALV 762 , nCfOSD 604 , nCfOSD 405 , nCfNMDA 1227 , nCfNMDA 738 , 
nCfCLBPlA 633 , nCfCLBPl A^,, nC£CLBP2A 631 , nCfCLBP2A441, nCfLGIC 2739 , 
nCfLGIC 2016 , nCfNAH 20g0 , nCfNAH lg24 , nC£CLIC 22g3 , nCfCLIC 7g6 , nC£PL2 1291 , 

15 nCfPL2 1173 , nCfPL34o 6 , nCfPL3 243) nCfPL4 974 , nCfPL4 1043 , nCfPL4 I062 , nCfPL4 g55 , 
nCfSVP 1875 , nCfSVP 1590 , nCfVGCC 381 , nCfVGCC 2191 , nCfVGCC 196g , nCfVGCC 673 , 
nCfVGCC 3126 , nCfVGCC 2J53 , nCfAUP llg „ nCfAUP 306 , nCf7B2 2161 , nCf7B2 g01 , nCf7B2 74 , 
or allelic variants of these nucleic acid molecules. Another preferred nucleic acid 
molecule of the present invention includes at least a portion of nucleic acid sequence SEQ 

20 ID NO:l, SEQ ID NO:3, SEQ ED NO:4, SEQ ID NO:6, SEQ ID NO:7, SEQ ID NO:9, 
SEQ ID NO:10, SEQ ID NO:12, SEQ ID NO:13, SEQ ED NO:15, SEQ ED NO:16, SEQ 
ID NO:18, SEQ ID NO:19, SEQ ID NO:21, SEQ ID NO:22, SEQ ID NO:24, SEQ ID 



-119- 



FC-6-C1 



NO:25, SEQ ID NO:27, SEQ ID NO:28, SEQ ID NO:30, SEQ ID N0:31, SEQ ID 
NO:33, SEQ ID NO:34, SEQ ID NO:36, SEQ ID NO:37, SEQ ID NO:39, SEQ ID 
NO:40, SEQ ID NO:42, SEQ ID NO:43, SEQ ID NO:45, SEQ ID NO:46, SEQ ID 
NO:48, SEQ ID NO: 153, SEQ ID NO: 155, SEQ ID NO: 156, SEQ ID NO: 158, SEQ ID 
5 NO: 159, SEQ ID NO: 161, SEQ ID NO: 162, SEQ ID NO: 164, SEQ ID NO: 165, SEQ ID 
NO: 167, SEQ ID NO: 168, SEQ ID NO: 170, SEQ ID NO: 1859, SEQ ID NO: 1860, SEQ 
ED NO: 1861, SEQ ID NO:1863, SEQ ID NO:1864, SEQ ID NO:1866, SEQ ID NO:1867, 
J5 SEQ ED NO: 1869, SEQ ID NO: 1870, SEQ ID NO: 1871, SEQ ID NO: 1872, SEQ ID 

tfl NO: 1874, SEQ ED NO: 1875, SEQ ID NO: 1876, SEQ ED NO: 1877, SEQ ID NO: 1878, 

jQ 10 SEQEDNO:1880, SEQ EDNO:1881, SEQ ID NO:1882, SEQ ID NO:1884, SEQ ED 

m NO: 1 885, SEQ ID NO: 1 886, SEQ ID NO : 1 887, SEQ ID NO: 1 889, SEQ ID NO: 1 890, 

r: 

i— 3. 

P SEQ ID NO: 1891, SEQ ID NO: 1892, SEQ ID NO: 1893, SEQ ID NO: 1894, SEQ ID 

P NO: 1895, SEQ ID NO: 1896, SEQ ED NO: 1898, SEQ ID NO: 1899, SEQ ID NO: 1900, 

O 

H SEQ ED NO:1901, SEQ ED NO:1903, SEQ ED NO:1904, SEQ ED NO:1905, SEQ ED 

15 NO: 1906, SEQ ED NO: 1907, SEQ ED NO:1908, SEQ ED NO: 1909, SEQ ED NO: 1910, 
SEQ ED NO:1911, SEQ ED NO:1912, SEQ ED NO: 1913, SEQ EDNO:1914, SEQ ED 
NO:1916, SEQ ED NO:1917, SEQ ED NO:1918, SEQ ED NO:1919, SEQ ED NO:1921, 
SEQ ED NO: 1922, SEQ ED NO: 1923, SEQ ED NO: 1924, SEQ ED NO: 1926, SEQ ED 
NO:1927, SEQ ED NO:1928, SEQ ED NO:1929, and/or SEQ ED NO:1931, and/or a 
20 nucleic acid molecule of Table I, Table II, Table EH or Table IV, as well as allelic variants 
of nucleic acid molecules having these nucleic acid sequences and homologues of nucleic 
acid molecules having these nucleic acid sequences; preferably such a homologue 
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encodes or is complementary to a nucleic acid molecule that encodes at least one epitope 
that elicits an immune response against a protein having an amino acid sequence SEQ ID 
N0:2, SEQ ID N0:8, SEQ ID NO: 14, SEQ ID NO:20, SEQ ID NO:26 5 SEQ ID NO:32, 
SEQ ID NO:38, SEQ ID NO:44, SEQ ID NO:154, SEQ ID NO:160, SEQ ID NO:163, 

5 SEQ ID NO: 169, SEQ ID NO:1862 5 SEQ ID NO: 1868, SEQ ID NO: 1873, SEQ ID 
NO:1879, SEQ ID NO:1883, SEQ ID NO:1888, SEQ ID NO:1897, SEQ ID NO:1902, 
SEQ ID NO:1915, SEQ ID NO:1920, SEQ ID NO:1925, and/or SEQ ID NO:1930. Such 
nucleic acid molecules can include nucleotides in addition to those included in the SEQ 
ID NOs, such as, but not limited to, a full-length gene, a full-length coding region, a 

1 0 nucleic acid molecule encoding a fusion protein, or a nucleic acid molecule encoding a 
multivalent protective compound. 

In one embodiment, HMT and/or HNC nucleic acid molecule of the present 
invention encodes a protein that is at least about 70%, preferably at least about 75%, more 
preferably at least about 80%, even more preferably at least about 85%, even more 

1 5 preferably at least about 90%, even more preferably at least about 95%, even more 

preferably at least about 98%, and even more preferably at least about 100% identical to 
PCfALN 384 , PCfCBP 272 , PCfNKAB 326 , PCfLGIC 569 , PCfANON 398 , PCfMALV 254 , 
PCfOSD 135 , PCfNMDA 246 , PCfCLBPlA 147 , PCfCLBP2A l4V , PCfLGIC 672 , PCfNAH 608 , 
PCfCLIC 262 , PCfPL2 391 , PCfPL3 8l , PCfPL4 285 , PCfSVP 530 , PCfVGCC 851 , PCfAUP 102 , 

20 PCf7B2 267 , PCf7B2 247 and/or a protein encoded by a nucleic acid molecule having a 
sequence of Table I, Table n, Table m and/or Table IV. 
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In one embodiment, a HMT and/or HNC nucleic acid molecule of the present 
invention encodes a protein having an amino acid sequence that is at least about 70%, 
preferably at least about 75%, more preferably at least about 80%, even more preferably 
at least about 85%, even more preferably at least about 90%, even more preferably at least 

5 about 95%, even more preferably at least about 98%, and even more preferably at least 
about 100% identical to SEQ ID NO:2, SEQ ID NO:8, SEQ ID NO:14, SEQ ID NO:20, 
SEQ ID NO:26, SEQ ID NO:32, SEQ ID NO:38, SEQ ID NO:44, SEQ ID NO: 154, SEQ 
ID NO: 160, SEQ ID NO: 163, SEQ ID NO: 169, SEQ ID NO: 1862, SEQ IDNO:1868, 
SEQ ID NO: 1873, SEQ ID NO: 1879, SEQ ID NO: 1883, SEQ ID NO: 1888, SEQ ID 

10 NO:1897, SEQ ID NO:1902, SEQ ID NO:1915, SEQ ID NO:1920, SEQ ID NO:1925, 
and/or SEQ ID NO: 1930 and/or a protein encoded by a nucleic acid molecule having a 
sequence of Table I, Table II, Table HI and/or Table IV. The present invention also 
includes a HMT and/or HNC nucleic acid molecule encoding a protein having at least a 
portion of SEQ ID NO:2, SEQ ID NO:8, SEQ ID NO:14, SEQ ED NO:20, SEQ ID 

15 NO:26, SEQ ID NO:32, SEQ ID NO:38, SEQ ID NO:44, SEQ ID NO: 154, SEQ ID 

NO: 160, SEQ ID NO: 163, SEQ ID NO: 169, SEQ ID NO: 1862, SEQ ID NO: 1868, SEQ 
ID NO: 1873, SEQ ID NO: 1879, SEQ ID NO: 1883, SEQ ID NO: 1888, SEQ ID NO: 1897, 
SEQ ID NO: 1902, SEQ ID NO: 19 15, SEQ ID NO: 1920, SEQ ID NO: 1925, and/or SEQ 
ID NO: 1930 and/or a protein encoded by a nucleic acid molecule having a sequence of 

20 Table I, Table II, Table m and/or Table TV, as well as allelic variants of a nucleic acid 
molecule encoding a protein having these sequences, including nucleic acid molecules 
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that have been modified to accommodate codon usage properties of the cells in which 
such nucleic acid molecules are to be expressed. 

In another embodiment, a preferred flea HMT and/or HNC nucleic acid molecule 
of the present invention comprises a nucleic acid molecule comprising at least about 15 

5 nucleotides, more preferably at least about 1 8 nucleotides, more preferably at least about 
20 nucleotides, more preferably at least about 25 nucleotides, more preferably at least 
about 30 nucleotides, more preferably at least about 40 nucleotides, more preferably at 
least about 50 nucleotides, more preferably at least about 100 nucleotides, more 
preferably at least about 150 nucleotides, more preferably at least about 350 nucleotides, 

10 more preferably at least about 450 nucleotides, more preferably at least about 550 

nucleotides, more preferably at least about 650 nucleotides, more preferably at least about 
750 nucleotides, more preferably at least about 1000 nucleotides, more preferably at least 
about 1500 nucleotides, more preferably at least about 1750 nucleotides more preferably 
at least about 2000 nucleotides, and even more preferably at least about 2250 nucleotides 

15 in length. 

In another embodiment, a preferred flea HMT and/or HNC nucleic acid molecule 
encodes a protein comprising at least about 5 amino acids, preferably at least about 6 
amino acids, more preferably at least about 10 amino acids, more preferably at least about 
15 amino acids, more preferably at least about 20 amino acids, more preferably at least 
20 about 25 amino acids, more preferably at least about 30 amino acids, more preferably at 
least about 40 amino acids, more preferably at least about 50 amino acids, more 
preferably at least about 100 amino acids, more preferably at least about 150 amino acids, 
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more preferably at least about 200 amino acids, more preferably at least about 300 amino 
acids, more preferably at least about 400 amino acids, more preferably at least about 500 
amino acids, even more preferably at least about 560 amino acids in length. 

In another embodiment, a preferred flea HMT and/or HNC nucleic acid molecule 

5 of the present invention comprises an apparently full-length HMT and/or HNC coding 
region, i.e., the preferred nucleic acid molecule encodes an apparently full-length HMT 
and/or HNC protein, or a post-translationally modified protein thereof. In one 
embodiment, a preferred HMT and/or HNC nucleic acid molecule of the present 
invention encodes a mature protein. 

10 In another embodiment, a preferred flea HMT and/or HNC nucleic acid molecule 

of the present invention comprises a nucleic acid molecule comprising SEQ ID NO:l, 
SEQ ID NO:3, SEQ ID NO:4, SEQ ID NO:6, SEQ ID NO:7, SEQ ID NO:9, SEQ ID 
NO: 10, SEQ ID NO: 12, SEQ ID NO: 13, SEQ ID NO: 15, SEQ ID NO: 16, SEQ ID 
NO:18, SEQ ID NO:19, SEQ ID NO:21, SEQ ID NO:22, SEQ ID NO:24, SEQ ID 

15 NO:25, SEQ ID NO:27, SEQ ID NO:28, SEQ ID NO:30, SEQ ID NO:31, SEQ ID 
NO:33, SEQ ID NO:34, SEQ ID NO:36, SEQ ID NO:37, SEQ ID NO:39, SEQ ID 
NO:40 5 SEQ ID NO:42, SEQ ID NO:43, SEQ ID NO:45, SEQ ID NO:46, SEQ ID 
NO:48, SEQ ID NO: 153, SEQ ID NO: 155, SEQ ID NO: 156, SEQ ID NO: 158, SEQ ID 
NO: 159, SEQ ID NO: 161, SEQ ID NO: 162, SEQ ID NO: 164, SEQ ID NO: 165, SEQ ID 

20 NO: 167, SEQ ID NO: 168, SEQ ID NO: 170, SEQ ID NO: 1859, SEQ ID NO: 1860, SEQ 
ID NO:1861, SEQ ID NO:1863, SEQ ID NO:1864, SEQ ID NO:1866, SEQ ID NO:1867, 
SEQ ID NO:1869, SEQ ID NO:1870, SEQ ID NO:1871, SEQ ID NO:1872, SEQ ID 
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NO: 1 874, SEQ ID NO: 1 875 , SEQ ID NO: 1 876, SEQ ID NO: 1 877, SEQ ID NO: 1 878, 
SEQ ID NO: 1880, SEQ ID NO: 1881, SEQ ID NO:1882, SEQ ID NO: 1884, SEQ ID 
NO: 1885, SEQ ID NO: 1886, SEQ ID NO: 1887, SEQ ID NO: 1889, SEQ ID NO: 1890, 
SEQ ID NO:1891, SEQ ID NO:1892, SEQ ID N0.1893, SEQ ID NO:1894, SEQ ID 
NO: 1 895, SEQ ID NO: 1 896, SEQ ID NO: 1898, SEQ ID NO: 1 899, SEQ ID NO: 1 900, 
SEQ ID NO: 1901, SEQ ID NO: 1903, SEQ ID NO: 1904, SEQ ID NO: 1905, SEQ ID 
NO: 1906, SEQ ID NO: 1907, SEQ ID NO: 1908, SEQ ID NO: 1909, SEQ ID NO: 19 10, 
SEQ ID NO:191 1, SEQ ID NO:1912, SEQ ID NO:1913, SEQ ID NO:1914, SEQ ID 
NO:1916, SEQEDNO:1917, SEQ ID NO: 1918, SEQ IDNO:1919, SEQ IDNO:1921, 
SEQ ID NO: 1922, SEQ ID NO: 1923, SEQ ID NO: 1924, SEQ ID NO: 1926, SEQ ID 
NO:1927, SEQ ID NO:1928, SEQ ID NO: 1929, and/or SEQ ID NO:1931. 

Knowing the nucleic acid sequences of certain flea HMT and/or HNC nucleic acid 
molecules of the present invention allows one skilled in the art to, for example, (a) make 
copies of those nucleic acid molecules, (b) obtain nucleic acid molecules including at 
least a portion of such nucleic acid molecules (e.g., nucleic acid molecules including full- 
length genes, full-length coding regions, regulatory control sequences, truncated coding 
regions), and (c) obtain other flea HMT and/or HNC nucleic acid molecules. Such 
nucleic acid molecules can be obtained in a variety of ways including screening 
appropriate expression libraries with antibodies of the present invention; traditional 
cloning techniques using oligonucleotide probes of the present invention to screen 
appropriate libraries; and PCR amplification of appropriate libraries or DNA using 
oligonucleotide primers of the present invention. Preferred libraries to screen or from 
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which to amplify nucleic acid molecules include flea 1 st instar larvae; 3 rd instar larvae, 
wandering larvae, prepupal larvae, pupae and whole adult flea cDNA libraries as well as 
genomic DNA libraries. Similarly, preferred DNA sources to screen or from which to 
amplify nucleic acid molecules include flea prepupal cDNA, adult cDNA and genomic 
DNA. Techniques to clone and amplify genes are disclosed, for example, in Sambrook et 
al. 5 ibid. 

The present invention also includes nucleic acid molecules that are 
oligonucleotides capable of hybridizing, under stringent hybridization conditions, with 
complementary regions of other, preferably longer, nucleic acid molecules of the present 
invention such as those comprising G felis HMT and/or HNC nucleic acid molecules or 
other flea HMT and/or HNC nucleic acid molecules. Oligonucleotides of the present 
invention can be RNA, DNA, or derivatives of either. The minimum size of such 
oligonucleotides is the size required for formation of a stable hybrid between an 
oligonucleotide and a complementary sequence on a nucleic acid molecule of the present 
invention. A preferred oligonucleotide of the present invention has a maximum size of 
preferably about 100 to 200 nucleotides. The present invention includes oligonucleotides 
that can be used as, for example, probes to identify nucleic acid molecules, primers to 
produce nucleic acid molecules, or therapeutic reagents to inhibit flea HMT and/or HNC 
protein production or activity (e.g., as antisense-, triplex formation-, ribozyme- and/or 
RNA drug-based reagents). The present invention also includes the use of such 
oligonucleotides to protect animals from disease using one or more of such technologies. 
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Appropriate oligonucleotide-containing therapeutic compositions can be administered to 
an animal using techniques known to those skilled in the art. 

One embodiment of the present invention includes a recombinant vector, which 
includes at least one isolated nucleic acid molecule of the present invention, inserted into 
any vector capable of delivering the nucleic acid molecule into a host cell. Such a vector 
contains heterologous nucleic acid sequences, that is nucleic acid sequences that are not 
naturally found adjacent to nucleic acid molecules of the present invention and that 
preferably are derived from a species other than the species from which the nucleic acid 
molecule(s) are derived. The vector can be either RNA or DNA, either prokaryotic or 
eukaryotic, and typically is a virus or a plasmid. Recombinant vectors can be used in the 
cloning, sequencing, and/or otherwise manipulating of flea HMT and/or HNC nucleic 
acid molecules of the present invention. 

One type of recombinant vector, referred to herein as a recombinant molecule, 
comprises a nucleic acid molecule of the present invention operatively linked to an 
expression vector. The phrase operatively linked refers to insertion of a nucleic acid 
molecule into an expression vector in a manner such that the molecule is able to be 
expressed when transformed into a host cell. As used herein, an expression vector is a 
DNA or RNA vector that is capable of transforming a host cell and of effecting 
expression of a specified nucleic acid molecule. Preferably, the expression vector is also 
capable of replicating within the host cell. Expression vectors can be either prokaryotic 
or eukaryotic, and are typically viruses or plasmids. Expression vectors of the present 
invention include any vectors that function (i.e., direct gene expression) in recombinant 
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cells of the present invention, including in bacterial, fungal, parasite, insect, other animal, 
and plant cells. Preferred expression vectors of the present invention can direct gene 
expression in bacterial, yeast, insect and mammalian cells, and more preferably in the cell 
types disclosed herein. 

In particular, expression vectors of the present invention contain regulatory 
sequences such as transcription control sequences, translation control sequences, origins 
of replication, and other regulatory sequences that are compatible with the recombinant 
cell and that control the expression of nucleic acid molecules of the present invention. In 
particular, recombinant molecules of the present invention include transcription control 
sequences. Transcription control sequences are sequences that control the initiation, 
elongation, and termination of transcription. Particularly important transcription control 
sequences are those which control transcription initiation, such as promoter, enhancer, 
operator and repressor sequences. Suitable transcription control sequences include any 
transcription control sequence that can function in at least one of the recombinant cells of 
the present invention. A variety of such transcription control sequences are known to 
those skilled in the art. Preferred transcription control sequences include those that 
function in bacterial, yeast, or insect and mammalian cells, such as, but not limited to, 
tac, lac, trp, trc, oxy-pro, omp/lpp, rrnB, bacteriophage lambda (such as lambda p L and 
lambda p R and fusions that include such promoters), bacteriophage T7, Tllac, 
bacteriophage T3, bacteriophage SP6, bacteriophage SP01, metallothionein, alpha-mating 
factor, Pichia alcohol oxidase, alphavirus subgenomic promoter, antibiotic resistance 
gene, baculovirus, Heliothis zea insect virus, vaccinia virus, herpesvirus, raccoon 
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poxvirus, other poxvirus, adenovirus, cytomegalovirus (such as immediate early 
promoter), simian virus 40, retrovirus, actin, retroviral long terminal repeat, Rous 
sarcoma virus, heat shock, phosphate and nitrate transcription control sequences as well 
as other sequences capable of controlling gene expression in prokaryotic or eukaryotic 

5 cells. Additional suitable transcription control sequences include tissue-specific 

promoters and enhancers as well as lymphokine-inducible promoters (e.g., promoters 
inducible by interferons or interleukins). Transcription control sequences of the present 
invention can also include naturally occurring transcription control sequences naturally 
associated with fleas, such as C.felis transcription control sequences. 

10 Suitable and preferred nucleic acid molecules to include in recombinant vectors of 

the present invention are as disclosed herein. Preferred nucleic acid molecules to include 
in recombinant vectors, and particularly in recombinant molecules, include nucleic acid 
molecules having a sequence of Table I, Table H, Table III and/or Table IV. Particularly 
preferred nucleic acid molecules to include in recombinant vectors, and particularly in 

15 recombinant molecules, include nCfALN 2057 , nCfALN 1152 , nCfCBP 1128 , nCfCBP 816 , 
nCfNKAB l7I4 , nCfNKAB 978 nCfLGIC 2240 , nCfLGIC 1707 , nCfANON 1429 , nCfANON 1194 , 
nCfMALV 765 , nCfMALV 762 , nCfOSD 604 , nCfOSD 405 , nCfNMDA 1227 , nCfNMDA 738 , 
nCfCLBPlA 633 , nCfCLBPlA 441 , nCfCLBP2A 631 , nCfCLBP2A 441 ,nCfLGIC 2739 , 
nCfLGIC 2016 , nCfNAH 2080 , nCfNAH 1824 , nCfCLIC 2283 , nCfCLIC 786 , nCfPL2 1291 , 

20 nCfPL2 U73 , nCfPL3 406 , nCfPL3 243 , nCfPL4 974 , nCfPL4 1043 , nCfPL4 1062 , nCfPL4 855 , 
nCfSVP 1875 , nCfSVP 1590 , nCfVGCC 381 , nCfVGCC 2191 , nCfVGCC 1968 , nCfVGCC 673 , 
nCfVGCC 3126 , nCfVGCC 2553 , nCfAUP U81 , nCfAUP 306 , nCf7B2 2I6! , nCf7B2 801 , nCf7B2 741 . 
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Recombinant molecules of the present invention may also (a) contain secretory 
signals (i.e., signal segment nucleic acid sequences) to enable an expressed flea protein of 
the present invention to be secreted from the cell that produces the protein and/or (b) 
contain fusion sequences which lead to the expression of nucleic acid molecules of the 

5 present invention as fusion proteins. Examples of suitable signal segments include any 
signal segment capable of directing the secretion of a protein of the present invention. 
Preferred signal segments include, but are not limited to, tissue plasminogen activator (t- 
PA), interferon, interleukin, growth hormone, histocompatibility and viral envelope 
glycoprotein signal segments. Suitable fusion segments encoded by fusion segment 

10 nucleic acids are disclosed herein. In addition, a nucleic acid molecule of the present 
invention can be joined to a fusion segment that directs the encoded protein to the 
proteosome, such as a ubiquitin fusion segment. Eukaryotic recombinant molecules may 
also include intervening and/or untranslated sequences surrounding and/or within the 
nucleic acid sequences of nucleic acid molecules of the present invention. 

1 5 Another embodiment of the present invention includes a recombinant cell 

comprising a host cell transformed with one or more recombinant molecules of the 
present invention. Transformation of a nucleic acid molecule into a cell can be 
accomplished by any method by which a nucleic acid molecule can be inserted into the 
cell. Transformation techniques include, but are not limited to, transfection, 

20 electroporation, microinjection, lipofection, adsorption, and protoplast fusion. A 
recombinant cell may remain unicellular or may grow into a tissue, organ or a 
multicellular organism. It is to be noted that a cell line refers to any recombinant cell of 
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the present invention that is not a transgenic animal. Transformed nucleic acid molecules 
of the present invention can remain extrachromosomal or can integrate into one or more 
sites within a chromosome of the transformed (i.e., recombinant) cell in such a manner 
that their ability to be expressed is retained. Preferred nucleic acid molecules with which 
5 to transform a cell include C.felis HMT and HNC nucleic acid molecules disclosed 
herein. Preferred nucleic acid molecules with which to transform a cell include nucleic 
acid molecules having a sequence of Table I, Table II, Table HI and/or Table IV. 
Particularly preferred nucleic acid molecules with which to transform a cell include 
nCfALN 2057 , nCfALN 11523 nCfCBP n28 , nCfCBP 816 , nC£NKAB 17I4 , nCfNKAB 978 
1 0 nCfLGIC 2240 , nC£LGIC 17073 nCfANON 14293 nCfANON t 194 , nCfMALV 765 , nCfMALV 762 , 
nCfOSD 604 , nCfOSD 405 , nCfNMDA 1227 , nCfNMDA 7383 nCfCLBPlA 633 , nCfCLBPlA 441 , 
nCfCLBP2A 63l3 nCfCLBP2A 441 , nCfLGIC 2739 , nCfLGIC 2016 , nCfNAH 2080 , nCfNAH 1824 , 
H nCfCLIC 2283 , nCfCLIC 7g6 , nCfPL2 1291 , nCfPL2 1173 , nCfPL3 406 , nCfPL3 243 , nCfPL4 9743 

^ nCfPL4 1043 , nCfPL4 1062 , nCfPL4 855 , nCfSVP 1875 , nCfSVP 1590 , nCfVGCC 381 , nCfVGCC 2191 , 

15 nCfVGCC 19683 nCfVGCC 673J nCfVGCC 31263 nCfVGCC 25533 nCfAUP 118l5 nCfAUP 3063 
nCf7B2 2l6l3 nCf7B2 80l3 or nCf7B2 741 . 

Suitable host cells to transform include any cell that can be transformed with a 
nucleic acid molecule of the present invention. Host cells can be either untransformed 
cells or cells that are already transformed with at least one nucleic acid molecule (e.g., 
20 nucleic acid molecules encoding one or more proteins of the present invention and/or 

other proteins useful in the production of multivalent vaccines). Host cells of the present 
invention either can be endogenously (i.e., naturally) capable of producing flea HMT 
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and/or HNC proteins of the present invention or can be capable of producing such 
proteins after being transformed with at least one nucleic acid molecule of the present 
invention. Host cells of the present invention can be any cell capable of producing at 
least one protein of the present invention, and include bacterial, fungal (including yeast), 
parasite (including helminth, protozoa and ectoparasite), other insect, other animal and 
plant cells. Preferred host cells include bacterial, mycobacterial, yeast, insect and 
mammalian ceils. More preferred host cells include Salmonella, Escherichia, Bacillus, 
Caulobacter, Listeria, Saccharomyces, Pichia, Spodoptera, Mycobacteria, Trichoplusia, 
BHK (baby hamster kidney) cells, MDCK cells (Madin-Darby canine kidney cell line), 
CRFK cells (Crandell feline kidney cell line), CV-1 cells (African monkey kidney cell 
line used, for example, to culture raccoon poxvirus), COS (e.g., COS-7) cells, and Vero 
cells. Particularly preferred host cells are Escherichia coli, including E. coli K-12 
derivatives; Salmonella typhi) Salmonella typhimurium, including attenuated strains such 
as UK-1 x 3987 and SR-1 1 x 4072; Caulobacter, Pichia; Spodoptera frugiperda; 
Trichoplusia ni\ BHK cells; MDCK cells; CRFK cells; CV-1 cells; COS cells; Vero cells; 
and non-tumorigenic mouse myoblast G8 cells (e.g., ATCC CRL 1246). Additional 
appropriate mammalian cell hosts include other kidney cell lines, other fibroblast cell 
lines (e.g., human, murine or chicken embryo fibroblast cell lines), myeloma cell lines, 
Chinese hamster ovary cells, mouse NIH/3T3 cells, LMTK 31 cells and/or HeLa cells. In 
one embodiment, the proteins may be expressed as heterologous proteins in myeloma cell 
lines employing immunoglobulin promoters. 
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A recombinant cell is preferably produced by transforming a host cell with one or 
more recombinant molecules, each comprising one or more nucleic acid molecules of the 
present invention operatively linked to an expression vector containing one or more 
transcription control sequences, examples of which are disclosed herein. The phrase 
operatively linked refers to insertion of a nucleic acid molecule into an expression vector 
in a manner such that the molecule is able to be expressed when transformed into a host 
cell. 

A recombinant cell of the present invention includes any cell transformed with at 
least one of any nucleic acid molecule of the present invention. Suitable and preferred 
nucleic acid molecules as well as suitable and preferred recombinant molecules with 
which to transfer cells are disclosed herein. 

Recombinant cells of the present invention can also be co-transformed with one or 
more recombinant molecules including flea HMT and/or HNC nucleic acid molecules 
encoding one or more proteins of the present invention and one or more other nucleic acid 
molecules encoding other protective compounds, as disclosed herein (e.g., to produce 
multivalent vaccines). 

Recombinant DNA technologies can be used to improve expression of 
transformed nucleic acid molecules by manipulating, for example, the number of copies 
of the nucleic acid molecules within a host cell, the efficiency with which those nucleic 
acid molecules are transcribed, the efficiency with which the resultant transcripts are 
translated, and the efficiency of post-translational modifications. Recombinant 
techniques useful for increasing the expression of nucleic acid molecules of the present 



-133- 



FC-6-C1 

invention include, but are not limited to, operatively linking nucleic acid molecules to 
high-copy number plasmids, integration of the nucleic acid molecules into one or more 
host cell chromosomes, addition of vector stability sequences to plasmids, substitutions or 
modifications of transcription control signals (e.g., promoters, operators, enhancers), 
substitutions or modifications of translational control signals (e.g., ribosome binding 
sites, Shine-Dalgarno sequences), modification of nucleic acid molecules of the present 
invention to correspond to the codon usage of the host cell, deletion of sequences that 
destabilize transcripts, and use of control signals that temporally separate recombinant 
cell growth from recombinant enzyme production during fermentation. The activity of an 
expressed recombinant protein of the present invention may be improved by fragmenting, 
modifying, or derivatizing nucleic acid molecules encoding such a protein. 

Isolated flea HMT and/or HNC proteins of the present invention can be produced 
in a variety of ways, including production and recovery of natural proteins, production 
and recovery of recombinant proteins, and chemical synthesis of the proteins. In one 
embodiment, an isolated protein of the present invention is produced by culturing a cell 
capable of expressing the protein under conditions effective to produce the protein, and 
recovering the protein. A preferred cell to culture is a recombinant cell of the present 
invention. Effective culture conditions include, but are not limited to, effective media, 
bioreactor, temperature, pH and oxygen conditions that permit protein production. An 
effective, medium refers to any medium in which a cell is cultured to produce a flea HMT 
and/or HNC protein of the present invention. Such medium typically comprises an 
aqueous medium having assimilable carbon, nitrogen and phosphate sources, and 
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appropriate salts, minerals, metals and other nutrients, such as vitamins. Cells of the 
present invention can be cultured in conventional fermentation bioreactors, shake flasks, 
test tubes, microtiter dishes, and petri plates. Culturing can be carried out at a 
temperature, pH and oxygen content appropriate for a recombinant cell Such culturing 

5 conditions are within the expertise of one of ordinary skill in the art. Examples of 
suitable conditions are included in the Examples section. 

Depending on the vector and host system used for production, resultant proteins of 
the present invention may either remain within the recombinant cell; be secreted into the 
fermentation medium; be secreted into a space between two cellular membranes, such as 

10 the periplasmic space in E. coli; or be retained on the outer surface of a cell or viral 
membrane. 

The phrase "recovering the protein", as well as similar phrases, refers to collecting 
the whole fermentation medium containing the protein and need not imply additional 
steps of separation or purification. Proteins of the present invention can be purified using 

15 a variety of standard protein purification techniques, such as, but not limited to, affinity 
chromatography, ion exchange chromatography, filtration, electrophoresis, hydrophobic 
interaction chromatography, gel filtration chromatography, reverse phase 
chromatography, concanavalin A chromatography, chromatofocusing and differential 
solubilization. Proteins of the present invention are preferably retrieved in "substantially 

20 pure" form. As used herein, "substantially pure" refers to a purity that allows for the 
effective use of the protein as a therapeutic composition or diagnostic. A therapeutic 
composition for animals, for example, should exhibit no substantial toxicity and 
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preferably should be capable of stimulating the production of antibodies in a treated 
animal. 

The present invention also includes isolated (i.e., removed from their natural 
milieu) antibodies that selectively bind to a flea HMT and/or HNC protein of the present 
invention or a mimetope thereof (e.g., mti-C.felis HMT or HNC antibodies). As used 
herein, the term "selectively binds to" an HMT and/or HNC protein refers to the ability of 
antibodies of the present invention to preferentially bind to specified proteins and 
mimetopes thereof of the present invention. Binding can be measured using a variety of 
methods standard in the art including enzyme immunoassays (e.g., ELISA), irnmunoblot 
assays, etc.; see, for example, Sambrook et al, ibid., and Harlow, et al., 1988, Antibodies, 
a Laboratory Manual, Cold Spring Harbor Labs Press; Harlow et al., ibid., is 
incorporated by reference herein in its entirety. An anti-HMT or anti-HNC antibody of 
the present invention preferably selectively binds to a flea HMT or HNC protein 
respectively in such a way as to inhibit the function of that protein. 

Isolated antibodies of the present invention can include antibodies in serum, or 
antibodies that have been purified to varying degrees. Antibodies of the present invention 
can be polyclonal or monoclonal, or can be functional equivalents such as antibody 
fragments and genetically-engineered antibodies, including single chain antibodies or 
chimeric antibodies that can bind to one or more epitopes. 

A preferred method to produce antibodies of the present invention includes 
(a) administering to an animal an effective amount of a protein, peptide or mimetope 
thereof of the present invention to produce the antibodies and (b) recovering the 
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antibodies. In another method, antibodies of the present invention are produced 
recombinant^ using techniques as heretofore disclosed to produce HMT and/or HNC 
proteins of the present invention. Antibodies raised against defined proteins or 
mimetopes can be advantageous because such antibodies are not substantially 
contaminated with antibodies against other substances that might otherwise cause 
interference in a diagnostic assay or side effects if used in a therapeutic composition. 

Antibodies of the present invention have a variety of potential uses that are within 
the scope of the present invention. For example, such antibodies can be used (a) as 
therapeutic compounds to passively immunize an animal in order to protect the animal 
from fleas susceptible to treatment by such antibodies and/or (b) as tools to screen 
expression libraries and/or to recover desired proteins of the present invention from a 
mixture of proteins and other contaminants. Furthermore, antibodies of the present 
invention can be used to target cytotoxic agents to fleas in order to directly kill such fleas. 
Targeting can be accomplished by conjugating (i.e., stably joining) such antibodies to the 
cytotoxic agents using techniques known to those skilled in the art. Suitable cytotoxic 
agents are known to those skilled in the art. 

One embodiment of the present invention is a therapeutic composition that, when 
administered to an animal susceptible to flea infestation, is capable of protecting that 
animal from flea infestation. Therapeutic compositions of the present invention include 
at least one of the following protective molecules: an isolated flea HMT and/or HNC 
protein; a mimetope of an isolated flea HMT and/or HNC protein; an isolated flea HMT 
and/or HNC nucleic acid molecule; and/or a compound derived from said isolated flea 
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HMT and/or HNC protein that inhibits HMT and/or HNC protein activity. A therapeutic 
composition of the present invention can further comprise a component selected from the 
group of an excipient, a carrier, and/or an adjuvant; these components are described 
further herein. As used herein, a protective molecule or protective compound refers to a 

5 compound that, when administered to an animal in an effective manner, is able to treat, 
ameliorate, and/or prevent flea infestation. Preferred fleas to target are heretofore 
disclosed. One example of a protective molecule is a vaccine, such as, but not limited to, 
a naked nucleic acid vaccine, a recombinant virus vaccine, a recombinant cell vaccine, 
and a recombinant protein vaccine. Another example of a protective molecule is a 

10 compound that inhibits HMT and/or HNC protein activity, such as an isolated antibody 
that selectively binds to a flea HMT and/or HNC protein, a substrate analog of a flea 
HMT and/or HNC protein, anti-sense-, triplex formation-, ribozyme-, and/or RNA drug- 
based compounds, or other inorganic or organic molecules that inhibit HMT and/or HNC 
protein activity. Inhibiting flea HMT and/or HNC protein activity can refer to the ability 

15 of a compound to reduce the activity of flea HMT and/or HNC proteins. Inhibiting flea 
HMT and/or HNC protein activity can also refer to the ability of a compound to reduce 
the amount of flea HMT and/or HNC protein in a flea. 

Another embodiment of the present invention includes a method to reduce a flea 
infestation in an animal susceptible to flea infestation. Such a method includes the step 

20 of administering to the animal a therapeutic molecule comprising a protective compound 
selected from the group consisting of (a) an isolated flea HMT and/or HNC protein; (b) a 
mimetope of an isolated flea HMT and/or HNC protein; (c) an isolated flea HMT and/or 
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HNC nucleic acid molecule; and (d) a compound derived from an isolated flea HMT 
and/or HNC protein that inhibits HMT and/or HNC protein activity. 

Therapeutic compositions of the present invention can be administered to any 
animal susceptible to flea infestation, preferably to mammals, and more preferably to 
dogs, cats, humans, ferrets, horses, cattle, sheep, and other pets, economic food animals, 
work animals and/or zoo animals. Preferred animals to protect against flea infestation 
include dogs, cats, humans, and ferrets, with dogs and cats being particularly preferred. 

As used herein, the term derived, or the term derived from, refers to a peptide, 
antibody, mimetope, nucleic acid molecule, or other compound that was obtained from a 
flea HMT and/or HNC protein or nucleic acid molecule of the present invention. 
Methods to obtain derivatives from a HMT and/or HNC molecule of the present 
invention are known in the art, and as such include, but are not limited to molecular 
modeling of HMT and/or HNC proteins to determine active sites, i.e. sites that interact 
with other molecules, and predicting from these active sites smaller fragments and/or 
mimetopes that retain and/or mimic these active sites, thereby inhibiting HMT and/or 
HNC protein activity; screening of peptide or small chemical compound libraries against 
HMT and/or HNC proteins of the present invention; and screening of polyclonal or 
monoclonal antibodies to find antibodies that specifically bind HMT and/or HNC 
proteins of the present invention. 

A HMT and/or HNC protein inhibitor of the present invention is identified by its 
ability to bind to, modify, or otherwise interact with, a flea HMT and/or HNC protein, 
thereby inhibiting the activity of HMT and/or HNC proteins. Suitable inhibitors of HMT 
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and/or HNC protein activity are compounds that inhibit HMT and/or HNC protein 
activity in at least one of a variety of ways: (a) by binding to or otherwise interacting 
with or otherwise modifying HMT and/or HNC protein sites; (b) by binding to or 
otherwise interacting with or otherwise modifying the HMT and/or HNC protein active 
site; (c) by binding to the HMT and/or HNC protein and thus reducing the availability of 
the HMT and/or HNC protein in solution; and (d) by interacting with other regions of the 
HMT and/or HNC protein to inhibit HMT and/or HNC protein activity, for example, by 
allosteric interaction. 

Flea HMT and/or HNC protein inhibitors can be used directly as compounds in 
compositions of the present invention to treat animals as long as such compounds are not 
harmful to host animals being treated. Preferred HMT and/or HNC protein inhibitors of 
the present invention include, but are not limited to, flea HMT and/or HNC protein 
substrate analogs, and other molecules that bind to a flea HMT and/or HNC proteins (e.g., 
to an allosteric site) in such a manner that the activity of the flea HMT and/or HNC 
protein is inhibited. A HMT and/or HNC protein substrate analog refers to a compound 
that interacts with (e.g., binds to, associates with, modifies) the active site of a HMT 
and/or HNC protein. A preferred HMT and/or HNC protein substrate analog inhibits 
HMT and/or HNC protein activity. HMT and/or HNC protein substrate analogs can be of 
any inorganic or organic composition. HMT and/or HNC protein substrate analogs can 
be, but need not be, structurally similar to a HMT and/or HNC protein natural substrate as 
long as they can interact with the active site of that HMT and/or HNC protein. HMT 
and/or HNC protein substrate analogs can be designed using computer-generated 
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structures of HMT and/or HNC proteins of the present invention or computer structures 
of HMT and/or HNC protein's natural substrates. Preferred sites to model include one or 
more of the active sites of HMT and/or HNC protein. Substrate analogs can also be 
obtained by generating random samples of molecules, such as oligonucleotides, peptides, 

5 peptidomimetic compounds, or other inorganic or organic molecules, and screening such 
samples for their ability to interfere with interaction between HMT and/or HNC proteins 
and their substrates, e.g. by affinity chromatography techniques. A preferred HMT 
and/or HNC protein substrate analog is a HMT and/or HNC protein mimetic compound, 
i.e., a compound that is structurally and/or functionally similar to a natural substrate of a 

10 HMT and/or HNC protein of the present invention, particularly to the region of the 
substrate that interacts with the HMT and/or HNC protein active site, but that inhibits 
HMT and/or HNC protein activity upon interacting with the HMT and/or HNC protein 
active site. 

The present invention also includes a therapeutic composition comprising at least 
15 one protective molecule of the present invention in combination with at least one 
additional compound protective against one or more infectious agents. 

In one embodiment, a therapeutic composition of the present invention can be 
used to protect an animal from flea infestation by administering such composition to a 
flea in order to prevent infestation. Such administration to the flea and/or animal could 
20 be oral, or by application to the animal's body surface (e.g. topical spot-on, or spraying 
onto the animal), or by application to the environment (e.g., spraying). Examples of such 
compositions include, but are not limited to, transgenic vectors capable of producing at 
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least one therapeutic composition of the present invention. In another embodiment a flea 
can ingest therapeutic compositions, or products thereof, present on the surface of or in 
the blood of a host animal that has been administered a therapeutic composition of the 
present invention. 

In accordance with the present invention, a host animal (i.e., an animal that is or is 
capable of being infested with fleas) is treated by administering to the animal a 
therapeutic composition of the present invention in such a manner that the composition 
itself (e.g., a HMT and/or HNC protein inhibitor, a HMT and/or HNC protein synthesis 
suppressor (i.e., a compound that decreases the production or half-life of a HMT and/or 
HNC protein in fleas), a HMT and/or HNC protein mimetope, or a anti-HMT and/or 
HNC antibody) or a product generated by the animal in response to administration of the 
composition (e.g., antibodies produced in response to administration of a flea HMT 
and/or HNC protein or nucleic acid molecule, or conversion of an inactive inhibitor 
"prodrug" to an active HMT and/or HNC protein inhibitor) ultimately enters the flea. A 
host animal is preferably treated in such a way that the compound or product thereof is 
present on the body surface of the animal or enters the blood stream of the animal. Fleas 
are then exposed to the composition or product when they feed from the animal. For 
example, flea HMT and/or HNC protein inhibitors administered to an animal are 
administered in such a way that the inhibitors enter the blood stream of the animal, where 
they can be taken up by feeding fleas. 

The present invention also includes the ability to reduce larval flea infestation in 
that when fleas feed from a host animal that has been administered a therapeutic 
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composition of the present invention, at least a portion of compounds of the present 
invention, or products thereof, in the blood taken up by the fleas are excreted by the fleas 
in feces, which is subsequently ingested by flea larvae. In particular, it is of note that flea 
larvae obtain most, if not all, of their nutrition from flea feces. 

In accordance with the present invention, reducing HMT and/or HNC protein 
activity in a flea can lead to a number of outcomes that reduce flea burden on treated 
animals and their surrounding environments. Such outcomes include, but are not limited 
to, (a) reducing the viability of fleas that feed from the treated animal, (b) reducing the 
fecundity of female fleas that feed from the treated animal, (c) reducing the reproductive 
capacity of male fleas that feed from the treated animal, (d) reducing the viability of eggs 
laid by female fleas that feed from the treated animal, (e) altering the blood feeding 
behavior of fleas that feed from the treated animal (e.g., fleas take up less volume per 
feeding or feed less frequently), (f) reducing the viability of flea larvae, for example due 
to the feeding of larvae from feces of fleas that feed from the treated animal, (g) altering 
the development of flea larvae (e.g., by decreasing feeding behavior, inhibiting growth, 
inhibiting (e.g., slowing or blocking) molting, and/or otherwise inhibiting maturation to 
adults), and/or (h) altering or decreasing the ability of fleas or flea larvae to digest a blood 
meal. 

In order to protect an animal from flea infestation, a therapeutic composition of 
the present invention is administered to the animal in an effective manner such that the 
composition is capable of protecting that animal from flea infestation. Therapeutic 
compositions of the present invention can be administered to animals prior to infestation 
in order to prevent infestation (i.e., as a preventative vaccine) and/or can be administered 
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to animals after infestation. For example, proteins, mimetopes thereof, and antibodies 
thereof can be used as immunotherapeutic agents. 

Therapeutic compositions of the present invention can be formulated in an 
excipient that the animal to be treated can tolerate. Examples of such excipients include 
water, saline, Ringer's solution, dextrose solution, Hank ! s solution, and other aqueous 
physiologically balanced salt solutions. Nonaqueous vehicles, such as fixed oils, sesame 
oil, ethyl oleate, or triglycerides may also be used. Other useful formulations include 
suspensions containing viscosity enhancing agents, such as sodium 
carboxymethylcellulose, sorbitol, or dextran. Excipients can also contain minor amounts 
of additives, such as substances that enhance isotonicity and chemical stability. 
Examples of buffers include phosphate buffer, bicarbonate buffer and Tris buffer, while 
examples of preservatives include thimerosal, or o-cresol, formalin and benzyl alcohol. 
Standard formulations can either be liquid injectables or solids which can be taken up in a 
suitable liquid as a suspension or solution for injection. Thus, in a non-liquid 
formulation, the excipient can comprise dextrose, human serum albumin, preservatives, 
etc., to which sterile water or saline can be added prior to administration. 

In one embodiment of the present invention, a therapeutic composition can 
include an adjuvant. Adjuvants are agents that are capable of enhancing the immune 
response of an animal to a specific antigen. Suitable adjuvants include, but are not 
limited to, cytokines, chemokines, and compounds that induce the production of 
cytokines and chemokines (e.g., granulocyte macrophage colony stimulating factor (GM- 
CSF), Flt-3 ligand, granulocyte colony stimulating factor (G-CSF), macrophage colony 
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stimulating factor (M-CSF), colony stimulating factor (CSF), erythropoietin (EPO), 
interleukin 2 (IL-2), interleukin-3 (IL-3), interleukin 4 (IL-4), interleukin 5 (IL-5) 3 
interleukin 6 (IL-6), interleukin 7 (IL-7), interleukin 8 (IL-8), interleukin 10 (EL- 10), 
interleukin 12 (IL-12), interferon gamma, interferon gamma inducing factor I (IGIF), 
transforming growth factor beta, RANTES (regulated upon activation, normal T cell 
expressed and presumably secreted), macrophage inflammatory proteins (e.g., MIP-1 
alpha and MDP-1 beta), and Leishmania elongation initiating factor (LEIF)); bacterial 
components (e.g., endotoxins, in particular superantigens, exotoxins and cell wall 
components); aluminum-based salts; calcium-based salts; silica; polynucleotides; toxoids; 
serum proteins, viral coat proteins; block copolymer adjuvants (e.g., Hunter's Titermax™ 
adjuvant (Vaxcel™, Inc. Norcross, GA), Ribi adjuvants (Ribi ImmunoChem Research, 
Inc., Hamilton, MT); and saponins and their derivatives (e.g., Quil A (Superfos Biosector 
A/S, Denmark). Protein adjuvants of the present invention can be delivered in the form 
of the protein themselves or of nucleic acid molecules encoding such proteins using the 
methods described herein. 

In one embodiment of the present invention, a therapeutic composition can 
include a carrier. Carriers include compounds that increase the half-life of a therapeutic 
composition in the treated animal. Suitable carriers include, but are not limited to, 
polymeric controlled release vehicles, biodegradable implants, liposomes, bacteria, 
viruses, other cells, oils, esters, and glycols. 

One embodiment of the present invention is a controlled release formulation that 
is capable of slowly releasing a composition of the present invention into an animal. As 
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used herein, a controlled release formulation comprises a composition of the present 
invention in a controlled release vehicle. Suitable controlled release vehicles include, but 
are not limited to, biocompatible polymers, other polymeric matrices, capsules, 
microcapsules, microparticles, bolus preparations, osmotic pumps, diffusion devices, 
5 liposomes, lipospheres, and transdermal delivery systems. Other controlled release 
formulations of the present invention include liquids that, upon administration to an 
animal, form a solid or a gel in situ. Preferred controlled release formulations are 
biodegradable (i.e., bioerodible). 

A preferred controlled release formulation of the present invention is capable of 
J3 10 releasing a composition of the present invention into the blood of the treated animal at a 

On constant rate sufficient to attain therapeutic dose levels of the composition to protect an 

j 8 * animal from flea infestation. The therapeutic composition is preferably released over a 

; *f period of time ranging from about 1 to about 12 months. A controlled release 

lI formulation of the present invention is capable of effecting a treatment preferably for at 

15 least about 1 month, more preferably for at least about 3 months, even more preferably for 
at least about 6 months, even more preferably for at least about 9 months, and even more 
preferably for at least about 12 months. 

Acceptable protocols to administer therapeutic compositions in an effective 
manner include individual dose size, number of doses, frequency of dose administration, 
20 and mode of administration. Determination of such protocols can be accomplished by 
those skilled in the art. A suitable single dose is a dose that is capable of protecting an 
animal from disease when administered one or more times over a suitable time period. 
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For example, a preferred single dose of a protein, mimetope or antibody therapeutic 
composition, including a recombinant protein vaccine, is from about 1 microgram (^ig) to 
about 10 milligrams (mg) of the therapeutic composition per kilogram body weight of the 
animal. Booster vaccinations can be administered from about 2 weeks to several years 

5 after the original administration. Booster administrations preferably are administered 
when the immune response of the animal becomes insufficient to protect the animal from 
disease. A preferred administration schedule is one in which from about 10 \ig to about 1 
mg of the therapeutic composition per kg body weight of the animal is administered from 
about one to about two times over a time period of from about 2 weeks to about 12 

10 months. Modes of administration can include, but are not limited to, subcutaneous, 
intradermal, intravenous, intranasal, oral, transdermal, intraocular, intranasal, 
conjunctival, and intramuscular routes. Methods of administration for other therapeutic 
compounds can be determined by one skilled in the art, and may include administration of 
a therapeutic composition one or more times, on a daily, weekly, monthly or yearly 

15 regimen; routes of administration can be determined by one skilled in the art, and may 
include any route. A preferred route of administration of an inhibitory compound when 
administering to fleas is a topical, or "spot-on" formulation administered to the body 
surface of the animal, so that a flea would encounter the inhibitory compound when 
attached to the animal; another preferred route of administration of an inhibitory 

20 compound is an oral formulation that, when fed to an animal, would enter the 

bloodstream of the animal, which would then be transferred to a flea while feeding from 
the animal. 
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A recombinant protein vaccine of the present invention comprises a 
recombinantly-produced flea HMT and/or HNC protein of the present invention that is 
administered to an animal according to a protocol that results in the animal producing a 
sufficient immune response to protect itself from a flea infestation. Such protocols can be 
determined by those skilled in the art. 

According to one embodiment, a nucleic acid molecule of the present invention 
can be administered to an animal in a fashion to enable expression of that nucleic acid 
molecule into a protective protein or protective RNA (e.g., antisense RNA, ribozyme, 
triple helix forms or RNA drug) in the animal. Nucleic acid molecules can be delivered 
to an animal in a variety of methods including, but not limited to, (a) administering a 
naked (i.e., not packaged in a viral coat or cellular membrane) nucleic acid as a genetic 
vaccine (e.g., as naked DNA or RNA molecules, such as is taught, for example in Wolff 
et al., 1990, Science 247, 1465-1468) or (b) administering a nucleic acid molecule 
packaged as a recombinant virus vaccine or as a recombinant cell vaccine (i.e., the 
nucleic acid molecule is delivered by a viral or cellular vehicle). 

A genetic (i.e., naked nucleic acid) vaccine of the present invention includes a 
nucleic acid molecule of the present invention and preferably includes a recombinant 
molecule of the present invention that preferably is replication, or otherwise 
amplification, competent. A genetic vaccine of the present invention can comprise one or 
more nucleic acid molecules of the present invention in the form of, for example, a 
dicistronic recombinant molecule. Preferred genetic vaccines include at least a portion of 
a viral genome, i.e., a viral vector. Preferred viral vectors include those based on 
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alphaviruses, poxviruses, adenoviruses, herpesviruses, picornaviruses, and retroviruses, 
with those based on alphaviruses, such as sindbis or Semliki forest virus, species-specific 
herpesviruses and poxviruses being particularly preferred. Any suitable transcription 
control sequence can be used, including those disclosed as suitable for protein production. 
Particularly preferred transcription control sequences include cytomegalovirus immediate 
early (preferably in conjunction with Intron-A), Rous sarcoma virus long terminal repeat, 
and tissue-specific transcription control sequences, as well as transcription control 
sequences endogenous to viral vectors if viral vectors are used. The incorporation of a 
"strong" polyadenylation signal is also preferred. 

Genetic vaccines of the present invention can be administered in a variety of 
ways, with intramuscular, subcutaneous, intradermal, transdermal, conjunctival, 
intraocular, intranasal and oral routes of administration being preferred. A preferred 
single dose of a genetic vaccine ranges from about 1 nanogram (ng) to about 600 j-tg 5 
depending on the route of administration and/or method of delivery, as can be determined 
by those skilled in the art. Suitable delivery methods include, for example, by injection, 
as drops, aerosolized and/or topically. Genetic vaccines of the present invention can be 
contained in an aqueous excipient (e.g., phosphate buffered saline) alone or in a carrier 
(e.g., lipid-based vehicles). 

A recombinant virus vaccine of the present invention includes a recombinant 
molecule of the present invention that is packaged in a viral coat and that can be 
expressed in an animal after administration. Preferably, the recombinant molecule is 
packaging- or replication-deficient and/or encodes an attenuated virus. A number of 
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recombinant viruses can be used, including, but not limited to, those based on 
alphaviruses, poxviruses, adenoviruses, herpesviruses, picornaviruses, and retroviruses. 
Preferred recombinant virus vaccines are those based on alphaviruses (such as Sindbis 
virus), raccoon poxviruses, species-specific herpesviruses and species-specific 
poxviruses. An example of methods to produce and use alphavirus recombinant virus 
vaccines are disclosed in U.S. Patent No. 5,766,602 to Xiong and Grieve, which is 
incorporated by reference herein in its entirety. 

When administered to an animal, a recombinant virus vaccine of the present 
invention infects cells within the immunized animal and directs the production of a 
protective protein or RNA nucleic acid molecule that is capable of protecting the animal 
from flea infestation as disclosed herein. For example, a recombinant virus vaccine 
comprising a flea HMT and/or HNC nucleic acid molecule of the present invention is 
administered according to a protocol that results in the animal producing a sufficient 
immune response to protect itself from flea infestation. A preferred single dose of a 
recombinant virus vaccine of the present invention is from about 1 x 10 4 to about 1 x 10 8 
virus plaque forming units (pfu) per kilogram body weight of the animal. Administration 
protocols are similar to those described herein for protein-based vaccines, with 
subcutaneous, intramuscular, intranasal, intraocular, conjunctival, and oral administration 
routes being preferred. 

A recombinant cell vaccine of the present invention includes recombinant ceils of 
the present invention that express at least one protein of the present invention. Preferred 
recombinant cells for this embodiment include Salmonella, E. coli, Listeria, 
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Mycobacterium, S. frugiperda, yeast, (including Saccharomyces cerevisiae and Pichia 
pastoris), BHK, CV-1, myoblast G8, COS (e.g., COS-7), Vero, MDCK and CRFK 
recombinant cells. Recombinant cell vaccines of the present invention can be 
administered in a variety of ways but have the advantage that they can be administered 
orally, preferably at doses ranging from about 10 8 to about 10 12 cells per kilogram body 
weight. Administration protocols are similar to those described herein for protein-based 
vaccines. Recombinant cell vaccines can comprise whole cells, cells stripped of cell 
walls or cell lysates. 

The efficacy of a therapeutic composition of the present invention to protect an 
animal from flea infestation can be tested in a variety of ways including, but not limited 
to, detection of protective antibodies (using, for example, proteins or mimetopes of the 
present invention), detection of cellular immunity within the treated animal, or challenge 
of the treated animal with the fleas to determine whether the treated animal is resistant to 
infestation. Challenge studies can include direct administration of fleas to the treated 
animal. In one embodiment, therapeutic compositions can be tested in animal models 
such as mice. Such techniques are known to those skilled in the art. 

One therapeutic composition of the present invention includes an inhibitor of flea 
HMT and/or HNC protein activity, i.e., a compound capable of substantially interfering 
with the function of a flea HMT and/or HNC protein susceptible to inhibition by an 
inhibitor of flea HMT and/or HNC protein activity. An inhibitor of HMT and/or HNC 
protein activity can be identified using flea HMT and/or HNC proteins of the present 
invention. An inhibitor of HMT and/or HNC protein function can be identified using flea 
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HMT and/or HNC proteins of the present invention. A preferred inhibitor of HMT and/or 
HNC protein function is a compound capable of substantially interfering with the 
function of a flea HMT and/or HNC protein and which does not substantially interfere 
with host animal proteins. As used herein, a compound that does not substantially inhibit 
5 or interfere with host animal proteins is one that, when administered to a host animal, the 
host animal shows no significant adverse effects attributable to the compound and which, 
when administered to an animal in an effective manner, is capable of protecting that 
% animal from flea infestation. 

yg One embodiment of the present invention is a method to identify a compound 

J3 10 capable of inhibiting HMT and/or HNC protein activity of a flea. Such a method 

liJ 

^ 1 includes the steps of (a) contacting (e.g., combining, mixing) an isolated flea HMT and/or 

Is. 

[7 HNC protein, preferably a C felis HMT and/or HNC protein of the present invention, 

i u 

lI with a putative inhibitory compound under conditions in which, in the absence of the 

M= compound, the protein has HMT and/or HNC protein activity, and (b) determining if the 

15 putative inhibitory compound inhibits the activity. HMT and/or HNC protein activity can 
be determined in a variety of ways known in the art, including but not limited to 
determining the ability of HMT and/or HNC protein to bind to or otherwise interact with 
a substrate. Such conditions under which a HMT and/or HNC protein has HMT and/or 
HNC protein activity include conditions in which a HMT and/or HNC protein has a 
20 correct three-dimensionally folded structure under physiologic conditions, i.e. 
physiologic pH, physiologic ionic concentrations, and physiologic temperatures. 
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Putative inhibitory compounds to screen include antibodies (including fragments 
and mimetopes thereof), putative substrate analogs, and other, preferably small, organic 
or inorganic molecules. Methods to determine HMT and/or HNC protein activity are 
known to those skilled in the art; see, for example, the Examples section of the present 
application. Methods to determine binding of a putative inhibitory compound to a HMT 
and/or HNC protein of the present invention are known to those of skill in the art and 
include, for example, determining changes in molecular mass using surface plasmon 
resonance (e.g., determining light scatter by an inhibitor of a HMT and/or HNC protein, 
before and after contacting the inhibitor or protein with a HMT and/or HNC protein or 
inhibitor, respectively) or screening for compounds that inhibit interaction between a 
HMT and/or HNC protein and a substrate. 

A preferred method to identify a compound capable of inhibiting HMT and/or 
HNC protein activity includes contacting an isolated flea HMT and/or HNC protein 
having an amino acid sequence selected from the group consisting of: (a) a protein 
comprising an amino acid sequence selected from the group consisting of SEQ ID NO:2, 
SEQ ID NO:8, SEQ ID NO:14 5 SEQ ID NO:20, SEQ ID NO:26, SEQ ID NO:32, SEQ ID 
NO:38, SEQ ID NO:44, SEQ ID NO: 154, SEQ ID NO: 160, SEQ ID NO: 163, SEQ ID 
NO: 169, SEQ ID NO: 1862, SEQ ID NO: 1868, SEQ ID NO: 1873, SEQ ID NO: 1879, 
SEQ ID NO:1883, SEQ ID NO:1888, SEQ ID NO: 1897, SEQ ID NO:1902, SEQ ID 
NO:1915, SEQ ID NO:1920, SEQ ID NO:1925, and/or SEQ ID NO:1930, and/or a 
protein encoded by a nucleic acid molecule of Table I, Table II, Table EI and/or Table 
IV; (b) a protein comprising an at least 25 consecutive amino acid portion identical in 
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sequence to a consecutive amino acid portion of a sequence as set forth in (a), wherein the 
protein has HMT and/or HNC protein activity; (c) a protein comprising a fragment of a 
protein as set forth in (a), wherein the fragment has an activity selected from the group 
consisting of binding to a HMT and/or HNC molecule and hydrolyzing a HMT and/or 
5 HNC protein substrate; and (d) a protein encoded by an allelic variant of a nucleic acid 
molecule that encodes any protein of (a), (b), or (c), with a putative inhibitory compound 
under conditions in which, in the absence of the compound, the protein has HMT and/or 
HNC protein activity; and determining if the putative inhibitory compound inhibits the 
activity. 

1 0 Another embodiment of the present invention is an assay kit to identify an 

inhibitor of a flea HMT and/or HNC protein of the present invention. This kit comprises 
an isolated flea HMT and/or HNC protein of the present invention, and a means for 
determining inhibition of an activity of flea HMT and/or HNC protein, where the means 
enables detection of inhibition. Detection of inhibition of flea HMT and/or HNC protein 

1 5 identifies a putative inhibitor to be an inhibitor of flea HMT and/or HNC protein. Means 
for determining inhibition of flea HMT and/or HNC protein include an assay system that 
detects binding of a putative inhibitor to a flea HMT and/or HNC molecule, and an assay 
system that detects interference by a putative inhibitor of the ability of flea HMT and/or 
HNC protein to hydrolyze a substrate. Means and methods are described herein and are 

20 known to those skilled in the art. 

The following examples are provided for the purposes of illustration and are not 
intended to limit the scope of the present invention. The following examples include a 
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number of recombinant DNA and protein chemistry techniques known to those skilled in 
the art; see, for example, Sambrook et al., ibid. 
Example 1 

This Example describes the isolation of RNA from the hindgut and Malpighian 

5 tubules (HMT) of Ctenocephalides felis and the use of isolated RNA to construct 
subtracted and unsubtracted cDNA libraries. 

Approximately 10,000 hindguts and Malpighian tubules were dissected from 
equal numbers of cat blood fed and unfed adult C. felis with a male to female ratio of 1 to 
4, and total RNA was extracted using a guanidine isothiocyanate lysis buffer and the 

10 standard procedure described by Sambrook et al. Poly-A enriched mRNA was purified 
from total RNA above using a mRNA Purification Kit, available from Pharmacia 
Biotech, Piscataway, NJ, following the manufacturer's protocol. The same procedures 
were used to extract total RNA and isolate poly-A enriched mRNA from the dissected 
C felis bodies following removal of HMT, referred to hereinafter as "non-HMT mRNA". 

15 Poly-A enriched mRNA was used to construct a cDNA library using subtractive 

hybridization and suppression PCR as follows. Subtractive hybridization and suppression 
PCR was conducted using a PCR-Select™ cDNA Subtraction Kit, available from 
Clontech Laboratories, Inc., Palo Alto, CA according to the manufacturer's instructions. 
Briefly, this kit uses subtractive hybridization and suppression PCR to specifically 

20 amplify cDNA sequences that are present in the tester cDNA and absent in the driver 
cDNA, thus enriching for tester-specific sequences. The efficiency of the subtraction 
process can be assessed by semi-quantitative PCR and by comparing the ethidium 
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bromide staining patterns of the subtracted and unsubtracted samples on agarose gels as 
described in section V.D. of the manufacturer's protocol. For the semi-quantitative PCR, 
three genes with mRNAs known to be expressed outside of the HMT tissue were used to 
test for specific subtraction. These genes encoded putative actin, N-aminopeptidase, and 
serine protease proteins. 

Subtractive hybridization and suppression PCR was conducted under the 
following conditions. Two micrograms (jig) of HMT mRNA was used as the template 
for synthesis of the tester material and 2 \ig of non-HMT mRNA was used as template for 
synthesis of the driver material in this reaction. The number of cycles used in the 
selective amplification steps was optimized using the manufacturer's protocols. 
Optimization resulted in the use of 24 rather than the standard 27 cycles of primary PCR 
in combination with 15 cycles of secondary PCR rather than the standard 12 cycles. 

The products from the suppressive PCR reaction were ligated into the pCR®2.1 
vector, available from Invitrogen, Carlsbad, CA, using an Original TA Cloning® Kit, 
available from Invitrogen. The ligation reaction was then used to transform INVccF* One 
Shot™ competent cells, available from Invitrogen, which were plated on Luria broth (LB) 
agar with 50 micrograms per milliliter (jig/ml) ampicillin, available from Sigma-Aldrich 
Co., St. Louis, MO, and 50 (ig/mi 5-bromo-4-chloro-3-indoyl p-D-galactopyranoside 
(X-Gal), available from Fisher Biotech, Fair Lawn, NJ. Transformed colonies were 
amplified and the DNA isolated using the standard alkaline lysis procedure described by 
Sambrook et al., ibid. 
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y3 



Automated cycle sequencing of DNA samples was performed using an ABI 
PRISM™ Model 377, available from Perkins Elmer, with XL upgrade DNA Sequencer, 
available from PE Applied Biosystems, Foster City, CA, after reactions were carried out 
using the PRISM™ Dye Terminator Cycle Sequencing Ready Reaction Kit or the PRISM™ 

5 dRhodamine Terminator Cycle Sequencing Ready Reaction Kit or the Prism™ BigDye™ 
Terminator Cycle sequencing Ready Reaction Kit, available from PE Applied 
Biosystems, following the manufacturer's protocol, hereinafter "standard sequencing 
methods". Sequence analysis was performed using the Mac Vector™ sequence analysis 
software, available from International Biotechnologies Inc., New Haven, CT, and the 

10 Wisconsin Package Version 9.0 sequence analysis software, available from Genetics 

Computer Group (GCG), Madison, WI, hereinafter referred to as GCG version 9.0, using 
default parameters. Each sequence read was trimmed of vector sequence at either end 
and submitted for a search through the National Center for Biotechnology Information 
(NCBI), National Library of Medicine, National Institute of Health, Baltimore, MD, using 

15 the BLAST network. This database includes SwissProt + PER + SPupdate + GenPept + 
GPUpdate + PDB databases. The search was conducted using the xBLAST function, 
which compares the translated sequences in all 6 reading frames to the protein sequences 
contained in the database. Clones with significant homology to sequences in the 
GenBank database were grouped according to proposed function and are listed in 

20 Table E. Clones with no significant homology to sequences in the GenBank database 
were searched manually for open reading frames and are listed in Table IV. 
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An unsubtracted HMT cDNA library was constructed as follows. Approximately 
10,000 HMT tissues were dissected from equal numbers of unfed and cat blood-fed adult 
C. felis with a male to female ratio of 1 :4. Total RNA was extracted using a guanidine 
isothiocyanate lysis buffer and procedures described in Sambrook et al., followed by 

5 isolation using a mRNA purification kit, available from Pharmacia, according to the 

manufacturer's protocols. The library was constructed with 5 |ag of isolated mRNA using 
a ZAP-cDNA® cDNA synthesis kit, and packaged using a ZAP-cDNA® Gigapack® 
gold cloning kit, both available from Stratagene, La Jolla, CA. The resultant HMT library 
was amplified to a titer of about 5 x 10 9 plaque forming units per milliliter (pfu/ml). 

10 Single clone excisions were performed using the Ex- Assist™ helper phage, available 
from Stratagene, and used to create double stranded plasmid template for sequencing 
using the manufacturer's protocols with the following exceptions. Following incubation 
of the SOLR cells with the cleared phage lysate, the mixture was used to inoculate LB 
broth, and the mix was incubated overnight and then subjected to mini-prep plasmid 

1 5 preparation and sequencing as described for the subtracted HMT library above. 
Example 2 

This Example describes the isolation of RNA from the head and nerve cord 
(HNC) of Ctenocephalides felis and the use of isolated RNA to construct subtracted and 
unsubtracted cDNA libraries. 
20 Approximately 4,000 heads and attached nerve cords, including the terminal 

abdominal ganglia were dissected from equal numbers of cat blood-fed and unfed adult 
C felis with a male to female ratio of 1 to 4, and total RNA was extracted using a 
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guanidine isothiocyanate lysis buffer and the standard procedure described by Sambrook 
et al. Approximately 618 ^ig of total RNA was recovered. Poly-A enriched mRNA was 
purified from total RNA above using a mRNA Purification Kit, available from 
Pharmacia, following the manufacturer's protocol. Approximately 13 \ig of mRNA was 

5 isolated. The same procedures were used to extract total RNA and isolate poly-A 
enriched mRNA from the dissected C felis bodies following removal of HNC tissues, 
referred to hereinafter as "non-HNC mRNA". 

Suppression sub tractive PCR was conducted as described in Example 1 using a 
PCR-Select™ cDNA Subtraction kit, available from Clontech, under the following 

10 conditions. Two micrograms (jig) of HNC mRNA was used as the template for synthesis 
of the tester material and 2 \ig of non-HMT mRNA was used as template for synthesis of 
the driver material in this reaction. The number of cycles used in the selective 
amplification steps was optimized using the manufacturer's protocols. Optimization 
resulted in the use of 24 rather than the standard 27 cycles of primary PCR in 

15 combination with either 12 or 15 cycles of secondary PCR. 

cDNA pools from various PCR cycling combinations were iigated into the TA 
vector using a TA cloning kit, available from Invitrogen. Aliquots of ligation reaction 
were transformed into Ultramax DH5«™ bacteria, available from Gibco-BRL, 
Gaithersburg, MD. Portions of the transformation mixes were used to inoculate LB broth 

20 cultures containing 100 jag/ml of ampicillin. The overnight cultures were plated to 
generate discreet colonies which were used individually for overnight cultures for 
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plasmid preps. Transformed colonies were amplified and the DNA isolated using the 
standard alkaline lysis procedure described by Sambrook et al., ibid. 

Automated cycle sequencing of DNA samples was performed using the standard 
sequencing methods described in Example 1. Sequence analysis was performed using the 
Mac Vector™ sequence analysis software, available from International Biotechnologies 
Inc., New Haven, CT, and the Wisconsin Package Version 9.0 sequence analysis 
software, available from Genetics Computer Group (GCG), Madison, WI, hereinafter 
referred to as GCG version 9.0, using default parameters. Each sequence read was 
trimmed of vector sequence at either end and submitted for a xBLAST search as 
described in Example 1. Clones with significant homology to sequences in the GenBank 
database were grouped according to proposed function and are listed in Table I. Clones 
with no significant homology to sequences in the GenBank database were searched 
manually for open reading frames and are listed in Table m. 

An unsubtracted cDNA library was constructed as follows. Approximately 6400 
head and nerve cords were dissected from C.felis and poly-A RNA was isolated as 
described above. About seven \ig of HNC poly-A RNA was used to construct a cDNA 
library using Stratagene's XZAP-cDNA Synthesis Kit and protocol. The resultant HNC 
library was amplified to a titer of about 5 x 10 9 plaque forming units per milliliter 
(pfu/ml). Single clone excisions were performed using the Ex-Assist helper phage, 
available from Stratagene, and used to create double stranded plasmid template for 
sequencing using the manufacturer's protocols with the following exceptions. Following 
incubation of the SOLR cells with the cleared phage lysate, the mixture was used to 
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inoculate LB broth, and the mix was incubated overnight and then subjected to mini-prep 
plasmid preparation and sequencing as described for the subtracted library above. 
Example 3 

This example describes the production of a C felis cDNA pool by Rapid 

5 Amplification of cDNA Ends (RACE cDNA pool). 

Total RNA was extracted from adult fed and unfed fleas as follows. 
Approximately 1000 adult fed fleas and 1000 adult unfed fleas were frozen on dry ice and 
separately ground into powder using a mortar and pestle and total RNA was extracted 
from each powder as follows. Ten ml of solution D (4 M guanidine isothiocyanate, 25 

10 mM Sodium Citrate pH 7.0, 1.5% Sarcosyl, 0.5 M 2-mercaptoethanol) were added to the 
powder and the suspension was mixed by shaking. One ml of 2M sodium acetate, pH 4.0 
and 3 ml of pH 4.7 phenol/chloroform/isoamyl alcohol (125:24:1), available from Sigma, 
were added and the suspension was mixed on a vortex shaker then incubated on ice for 15 
minutes. Following incubation, the mixture was centrifuged at 10,000 X g for 20 minutes 

15 and the supernatant was removed and extracted twice with pH 4.7 

phenol/chloroform/isoamyl alcohol. Next, an equal volume of isopropanol was added to 
the supernatant and incubated at -20°C for 2 hours followed by centrifugation at 10,000 X 
g for 20 minutes. Following centrifugation, the supernatant was removed and discarded 
and the pellet was washed in 70% ethanol and allowed to dry at room temperature. The 

20 pellet was resuspended in 10 mM Tris 1 mM EDTA pH 8.0. Spectrophotometer analysis 
indicated that the yield of total RNA from unfed fleas was 1 140 jig and the yield from fed 
fleas was 1500 ng. 
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Six-hundred ^g from each of the fed and unfed adult flea total RNA extractions 
were combined and mRNA was then extracted using a mRNA Purification Kit, available 
from Amersham Pharmacia Biotech, Piscataway, NJ, using the manufacture's protocol. 
Approximately 15-25 fig of mRNA were isolated based on spectrophotometer analysis 
and ethidium bromide staining. One jig of purified mRNA was used as template to 
construct a RACE cDNA pool using a Marathon cDNA Amplification Kit, available from 
Clontech Laboratories, Inc., Palo Alto, CA, according to the manufactured instructions. 
Example 4 

This example describes the cloning, sequencing, recombinant protein expression 
and purification of a C.felis allantoinase nucleic acid molecule of the present invention. 
This example also describes the expression of allantoinase mRNA in a variety of flea 
tissues. 

A TA clone from the HMT EST library described in Example 1 was sequenced 
using standard sequencing methods and shown to have significant homology to 
allantoinase genes. This clone was digested with EcoKL to excise an insert 682 
nucleotides in length, referred to as flea nucleic acid molecule nCfALN 682 . The insert 
was isolated by gel purification using a Gel Purification kit, available from Qiagen, 
Chatsworth, CA. Approximately 50 nanograms (ng) of purified nC£ALN 6g2 was used to 
construct a 32 P a-dATP labeled DNA probe using a Megaprime DNA labeling kit, 
available from Amersham, Arlington Heights, IL, using the manufacturer's protocols. 

The 32 P a-dATP labeled probe was used in a standard plaque lift hybridization 
procedure to isolate a clone from the HMT lambda-ZAP unsubtracted cDNA library 
described in Example 1. The following hybridization conditions were used, hereinafter 
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referred to as "standard hybridization conditions". Filters were hybridized with about 
1 X 10 6 counts per minute (cpm) per ml of the probe in 5X SSPE, (see Sambrook et al., 
ibid.), 1.2% sodium dodecyl sulfate (SDS), 0.1 mg/ml salmon sperm DNA and 5X 
Denhardt's reagent, (see Sambrook et al, ibid.), at 55°C for about 14 hours. The filters 
5 were washed as follows: (a) 10 minutes with 5X SSPE and 1% SDS, (b) 10 minutes with 
2X SSPE and 1% SDS, (c) 10 minutes with IX SSPE and 0.5% SDS, and (d) 10 minutes 
with 0.5X SSPE and 1% SDS. All washes were conducted at 55°C Plaques that 
hybridized strongly to the probe were isolated and subjected to in vivo excision. In vivo 

'% excision was performed using the Stratagene Ex-Assist™ helper phage system and 

1 1 

In 10 protocols, to convert a positive plaque to pBluescript™ plasmid DNA. Sequencing was 

01 conducted using standard sequencing methods following preparation of DNA with a 

Qiagen Qiaprep™ spin mini prep kit using the manufacturer's instructions and restriction 
[: enzyme digestion with about 1 ^1 of 20 U/^il each of EcoKL and Xhol, available from New 

u England Biolabs, Beverly, MA. A clone was isolated from a primary plaque, containing a 

15 nucleic acid molecule of about 2057 base pairs, referred to herein as nCfALN 2057 , having 
a nucleotide sequence denoted herein as SEQ ID NO: 1 . The complement of SEQ ID 
NO: 1 is represented herein as SEQ ID NO:3 . Sequencing of nCfALN 682 indicates that 
nCfALN 682 shared 100% identity with nucleotides 855 through 1536 of SEQ IDNO:l. 
Translation of SEQ ID NO:l suggests that nucleic acid molecule nCfALN 2057 
20 encodes a full-length allantoinase protein of 384 amino acids, referred to herein as 

PCfALN 384 , having an amino acid sequence represented by SEQ ID NO:2, assuming the 
initiation codon spans from nucleotide 152 through nucleotide 154 of SEQ ID NO:l and 
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the termination codon spans from nucleotide 1304 through nucleotide 1306 of SEQ ID 
NO: 1 . The coding region encoding PCfALN 3g4 , is represented by nucleic acid molecule 
nCfALN 1152 , having a coding strand with the nucleic acid sequence represented by SEQ 
ID NO:4 and a complementary strand with nucleic acid sequence represented by SEQ ID 

5 NO:6. The amino acid sequence of PCfALN 3g4 also represented as SEQ ID NO:5, 
predicts that PCfALN 384 has an estimated molecular weight of about 42.2 kilodaltons 
(kDa) and an estimated isoelectric point (pi) of about 6. 

Comparison of amino acid sequence SEQ ID NO:2 with amino acid sequences 
reported in GenBank indicates that SEQ ID NO:2 showed the most homology, i.e., about 

10 48.6% identity, with a Rana catesbeiana (bullfrog) allantoinase protein, GenBank 
Accession No. 458126. Comparison of SEQ ID NO:4 with nucleic acid sequences 
reported in GenBank indicates that SEQ ID NO:4 showed the most homology, i.e., about 
51% identity, with a Rana catesbeiana nucleic acid molecule, GenBank Accession 
number U03471. Percent identity calculations were performed using GCG version 9.0 

1 5 using default parameters. 

The coding region of nCfALN 2057 , i.e. SEQ ID NO:4, was PCR amplified from the 
pBluescript™ clone described above as the template, using sense primer ALN-FE, having 
nucleotide sequence 5* GCG GAT CCT ATG CTG AAT TGC AAG AAC CTT G 3 1 , 
having a BamHl site indicated in bold, designated herein as SEQ ID NO:37, and anti- 

20 sense primer ALN-RE, having nucleotide sequence 5' CAG GTA CCC TCT TTT AGA 
AGC ACC GGT CCC 3', having a Kpnl site indicated in bold, designated herein as SEQ 
ID NO:38. PCR reactions were performed using the following amplification cycles: (a) 
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one cycle at 95°C for thirty seconds; (b) thirty cycles at 95°C for twenty seconds, 50°C for 
twenty seconds, and 72°C for two minutes; and (c) one cycle at 72°C for five minutes, 
hereinafter referred to as "standard thermocycling conditions", in reactions containing 2.5 
mM MgCl 2 , 0.2 mM dNTPs, 1 (iM of each primer, 0.5 jil of 5U/)il Taq polymerase, 1 jal 
of 1 jag/jLil template, and 3 (il of 10X Taq buffer, hereinafter referred to as "standard PCR 
reaction conditions". The PCR product was digested with BamKL and Kpnl and ligated 
into the vector pTrcHisB, available from Invitrogen, that had been digested with BamHI 
and Kpnl and treated with alkaline phosphatase. The resulting recombinant molecule, 
referred to herein as pTrc-nCfALN 1152 , was transformed into E. coli strain BL21, 
available fromNovagen Inc., Madison, WI, to form recombinant cell E. co/z:pTrc- 
nCfALNf^ 

The recombinant cell was grown under standard conditions and then incubated in 
the presence of 0.5 ^iM isopropylthio-P-galactoside (IPTG) to induce expression of 
recombinant protein, predicted to be approximately 42.2 kDa. Expression was confirmed 
using Coomassie-blue-stained Tris-glycine gel and by Western blot using a T7 tag 
antibody, available fromNovagen, which showed expression of an about 55-kDa protein. 
The protein product was purified by liquid chromatography using a HiTrap™ chelating 
column charged with NiCl 2 , available from Pharmacia, and was shown to contain the His 
tag of the vector when subjected to automated protein sequencing by Edman degradation. 

A Northern Blot analysis was conducted as follows to determine whether 
allantoinase is expressed exclusively in HMT tissues. HMT tissues were dissected from 
1000 adult cat blood-fed C. felis having a male to female ratio of 1 :4. Total RNA was 
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separately extracted from HMT tissues and the HMT-less carcasses that resulted from 
these dissections as follows. The tissues were frozen at -80°C, ground into a powder with 
a mortar and pestle, and the powders were equally divided into four 2-ml eppendorf tubes 
each containing 1 ml of lysis buffer. The lysis buffer contained 4 M guanidinium 
thiocyanate, 25 mM sodium citrate, pH 7.0, 3% sarcosyl, 0.5M 2-mercaptoethanol, 0.1% 
antifoam, and 1 mM aurintricarboxylic acid, all available from Sigma Chemical 
Corporation, St. Louis, MO. After mixing, the tubes were spun at 14,000 rpm for 2 
minutes and the supernatants were transferred to separate 2 ml eppendorf tubes 
containing 250 jxl of phenol, available from Aldrich, Milwaukee, WI. After mixing, the 
tubes were spun at 14,000 rpm for 5 minutes and the supernatants were transferred to new 
2-ml tubes. This process was repeated 3 times until no proteinaceous matter was visible at 
the phenol/lysis buffer interface, then 250 (xl of chloroform was added to each tube and 
the contents mixed and spun at 14,000 rpm for 5 minutes followed by transferring the 
supernatant to a new tube. A volume of isopropanol equal to the volume of the 
supernatant was added to each tube and the tubes placed on ice for 5 minutes. The tubes 
were then spun at 14,000 rpm at room temperature for 15 minutes, the supernatants were 
removed and discarded and the remaining RNA pellets were washed with 70% ethanol 
and dried. The RNA pellets were resuspended in 100 |il of TE (10 mM Tris, 1 mM 
ethylenediaminetetraacetic acid (EDTA)). The quantity of RNA in each tube was then 
determined using a spectrophotometer. 

Approximately 10 \ig of each RNA was added to separate tubes containing 
18.75 \il of loading buffer, which consists of 50% formamide, 16% formaldehyde, 17% 
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water, 7% glycerol, 1 X MOPS buffer (a 1:20 dilution of 0.4 M 93-[N- 
morpholino]propanesulfonic acid (MOPS), 0.1 M sodium acetate, and 20 mM EDTA), 
10 jal ethidium bromide, and 10 (il bromophenol blue dye, all available from Sigma. The 
tubes were heated to 95°C for 2 minutes then placed on ice. The RNA samples were 
5 separated by gel electrophoresis on a 1.5% agarose gel with 3 .2% formaldehyde and 
1 X MOPS buffer; the gel was then soaked in water for 30 minutes prior to transfer to 
remove excess formaldehyde. The gel was then transferred using standard techniques, 

O 

yg described by Sambrook et aL, ibid, with 10 X SSPE as the transfer buffer onto Nytran® 

y3 nylon membrane, available from Schleicher and Schuell Inc., Keene, NH. The membrane 

Mi 

rl 10 was UV cross-linked using the Stratalinker®, available from Stratagene, then 

prehybridized at 42°C in 50% formamide, 5X SSPE, 1.2% SDS, 5X Denhardt's reagent, 

L 2.5 mM EDTA, and 100 jig/ml salmon sperm DNA. A probe comprising the allantoinase 

ry 

M EST nucleic acid molecule, nC£ALN 682 was labeled with a- 32 P-ATP using a DNA 

O 

^ labeling kit, available from Amersham and added to the buffer at a concentration of 

15 approximately 1 x 10 6 cpm/ml, and allowed to hybridize for 18 hours at 42°C. The blot 
was then washed as follows: 10 minutes at 42°C in 4X SSPE and 1% SDS; 10 minutes at 
42°C in 2X SSPE and 1% SDS; 10 minutes at 42°C with 0.5X SSPE and 0.5X SDS; and 
10 minutes at 42°C with 0.25X SSPE and 0.25% SDS. The blot was then exposed to film 
for 1 hour, and the film was developed using standard procedures. Analysis of the 
20 developed film revealed that allantoinase mRNA was present in HMT tissues but was not 
present in non-HMT tissues. 
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Northern Blot analysis was also conducted to determine whether allantoinase 
mRNA is expressed only in certain stages of the flea life cycle and whether allantoinase 
mRNA expression is influenced by feeding. Total RNA was extracted as described above 
from 1000 fleas at each of the following flea life stages; eggs, first instar larvae, third 
instar larvae, wandering larvae and pupae and from 1000 adult fleas under the following 
feeding conditions; unfed, fed on cat blood for 15 minutes, fed on cat blood for 2 hours, 
fed on cat blood for 8 hours, and fed on cat blood for 24 hours. 

Each RNA sample was separated by gel electrophoresis, transferred to nylon 
membrane and hybridized with a- 32 P-ATP labeled nCfALN 682 probe as described above. 
Analysis of the developed film revealed that allantoinase mRNA was expressed in all 
adult fleas tested regardless of feeding conditions and was expressed by all life stages 
except for eggs and pupae, the two life stages which do not feed or excrete urine. 
Example 5 

This example describes the cloning, sequencing, recombinant protein expression 
and purification of a G felis chitin-binding protein nucleic acid molecule. This example 
also describes the expression of chitin-binding protein mRNA in a variety of flea tissues. 

A TA clone from the HMT EST library described in Example 1 was sequenced 
using standard sequencing methods and shown to have homology to a chitinase-like gene 
from Bombyx mori (silkworm). This clone was digested with EcoKL to excise an insert 
about 429 nucleotides in length, referred to as chitin-binding protein (CBP) nucleic acid 
molecule nCfCBP 429 . The insert was isolated by gel purification using a Gel Purification 
kit, available from Qiagen. Approximately 50 ng of purified nCfCBP 429 was used to 
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construct a 32 P oc-dATP labeled DNA probe using a Megaprime DNA labeling kit, 
available from Amersham, using the manufacturer's protocols. 

The 32 P a-dATP labeled probe was used in a plaque lift hybridization procedure to 
isolate a clone from the HMT lambda-ZAP unsubtracted cDNA library described in 
Example 1, using standard hybridization conditions described in Example 4. Plaques that 
hybridized strongly to the probe were isolated and subjected to in vivo excision. In vivo 
excision was performed using the Stratagene Ex-Assist™ helper phage system and 
protocols, to convert a positive plaque to pBluescript™ piasmid DNA, and sequencing 
was conducted following preparation of DNA with a Qiagen Qiaprep™ spin mini prep kit 
using the manufacturer's instructions and restriction enzyme digestion with about 1 \il of 
20 U/jxl each of EcoRI and Xhol, available from New England Biolabs. A clone was 
isolated from a primary plaque, containing a nucleic acid molecule of about 1 128 base 
pairs, referred to herein as nCfCBP 1128 , having a nucleotide sequence denoted herein as 
SEQ ID NO:7. The complement of SEQ ID NO:7 is represented herein as SEQ ID NO:9. 
Sequencing of nCfCBP 429 indicated that nCfCBP 429 shares 100% identity with nucleotides 
148 through 576 of SEQ ID NO:7. 

Translation of SEQ ID NO:7 suggests that nucleic acid molecule nCfCBP 1I28 
encodes a full-length chitin-binding protein of 272 amino acids, referred to herein as 
PCfCfCBP 272 , having an amino acid sequence represented by SEQ ID NO:8, assuming the 
initiation codon spans from nucleotide 6 through nucleotide 8 of SEQ ID NO:7 and the 
termination codon spans from nucleotide 822 through nucleotide 824 of SEQ ID NO:7. 
The coding region encoding PCfCBP 2725 is represented by nucleic acid molecule 
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nCfCBP 816 , having a coding strand with the nucleic acid sequence represented by SEQ ID 
NO; 10 and a complementary strand with nucleic acid sequence represented by SEQ ID 
NO: 12. The amino acid sequence of PCfCBP 272 , also represented as SEQ ID NO: 11, 
predicts that PCfCBP 272 has an estimated molecular weight of about 30.6 kDa and an 
estimated pi of about 7.3. 

Comparison of amino acid sequence SEQ ID NO: 8 with amino acid sequences 
reported in GenBank indicates that SEQ ID NO: 8 showed the most homology, i.e., about 
26% identity with zLucilia cuprina peritrophin-44 protein, GenBank Accession No. 
407976. Comparison of SEQ ID NO: 10 with nucleic acid sequences reported in 
GenBank indicates that SEQ ID NO: 10 showed the most homology, i.e., about 40% with 
a Lucilia cuprina peritrophin-44 nucleic acid molecule, GenBank Accession number 
L25106. Percent identity calculations were performed using GCG version 9.0 using 
default parameters. 

A nucleic acid molecule comprising nucleotides 59 through 827 of SEQ ID NO:7, 
encoding a predicted mature flea chitin-binding protein, was PCR amplified from the 
pBluescript™ clone ciescribed above as the template, using sense primer CBP-FE, having 
nucleotide sequence 5' GGG GAT CCT GCT GAC AGG AAT TCG CCC AC 3', having 

A ^ 

a BamHL site indicated in bold, designated herein as SEQ ID NO(39>, and anti-sense 




primer CBP-RE, having nuclide sequence 5' CAT GGT ACC CCT GGT TTA AGC 
CTT ACT TAG C 3', having a Kptrt site indicated in bold, designated herein as SEQ ID 

tJ z X 

NO:B8J PCR reactions were performed using standard PCR reaction and thermocycling 
conditions described in Example 4. TheVcR product was digested with BamEl and 
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Kpnl and ligated into the vector pTrcHisB, available from Invitrogen, that had been 
digested with jBamHI and Kpnl and treated with alkaline phosphatase. The resulting 
recombinant molecule, referred to herein as pTrc-nCfCBP 769 , was transformed into E. coli 
strain BL21, available from Novagen, to form recombinant cell E. co//:pTrc-nCfCBP 769 . 
The recombinant cell was grown under standard conditions and then incubated in the 
presence of 0.5 |uM IFTG to induce expression of recombinant protein, predicted to be a 
protein of approximately 32 kDa. Expression of protein was confirmed using Coomassie- 
blue-stained Tris-glycinevgel and by Western blot using a T7 tag antibody which showed 
expression of an about 32-tDa protein. The protein product was purified by liquid 
10 chromatography using a HiTtap TM chelating column charged with NiCl 2 , available from 
Pharmacia, and was shown to contain the His tag of the vector when subjected to 
automated protein sequencing byEdman degradation. 

Northern Blot analysis was conducted as described in Example 4 to determine 
whether CBP mRNA is expressed in only HMT tissue, only in certain stages of the flea 
15 life cycle and whether CBP mRNA expression is influenced by feeding. Total RNA was 
extracted from flea tissues, life stages and feeding conditions as described in Example 4. 
Each RNA sample was separated by gel electrophoresis, transferred to a nylon membrane 
and hybridized with <x- 32 P-ATP labeled nCfCBP 429 under the Northern Blotting conditions 
described in Example 4. Analysis of the developed film revealed that CBP mRNA was 
20 expressed in HMT tissues but not in non-HMT tissues. CBP mRNA was also detected in 
all adult fleas tested regardless of feeding conditions but was not detected in any of the 
non-adult life stages. 
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Example 6 

This example describes the cloning and sequencing of a C.felis sodium/potassium 
ATPase, beta subunit nucleic acid molecule. 

A TA clone from the HMT EST library described in Example 1 was sequenced 
5 using standard sequencing methods and shown to have homology to the nervous system 
antigen 1 gene from Drosophila melanogaster. This clone was digested with EcoRI to 
excise an insert about 439 nucleotides in length, referred to as flea NKB nucleic acid 
molecule nCfNKAB 439 . The insert was isolated by gel purification using a Gel 
Purification kit, available from Qiagen. Approximately 50 ng of purified nCfNKAB 439 

10 was used to construct a 32 P a-dATP labeled DNA probe using a Megaprime DNA 
labeling kit, available from Amersham, using the manufacturer's protocols. 

The 32 P a-dATP labeled probe was used in a plaque lift hybridization procedure to 
isolate a clone from the HMT lambda-ZAP unsubtracted cDNA library described in 
Example 1, using standard hybridization conditions described in Example 4. Plaques that 

15 hybridized strongly to the probe were isolated and subjected to in vivo excision. In vivo 
excision was performed using the Stratagene Ex- Assist™ helper phage system and 
protocols, to convert a positive plaque to pBluescript™ plasmid DNA, and sequencing 
was conducted following preparation of DNA with a Qiagen Qiaprep™ spin mini prep kit 
using the manufacturer's instructions and restriction enzyme digestion with about 1 (il of 

20 20 U/jil each of EcoRI and Xhol, available from New England Biolabs. A clone was 

isolated from a secondary plaque, containing a nucleic acid molecule of about 1714 base 
pairs, referred to herein as nCfNKAB l7]4 , having a nucleotide sequence denoted herein as 
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SEQ ID N0:13. The complement of SEQ ID NO:13 is represented herein as SEQ ID 
NO: 15. Sequencing of nCfNKAB 439 indicates that nCfNKAB 439 shared 100% identity 
with nucleotides 907 through 1345 of SEQ ID NO:13. 

Translation of SEQ ID NO: 13 suggests that nucleic acid molecule nCfNKAB 1714 

5 encodes a full-length NKAB protein of 326 amino acids, referred to herein as 

PCfNKAB 326 , having an amino acid sequence represented by SEQ ID NO: 14, assuming 
the initiation codon spans from nucleotide 294 through nucleotide 296 of SEQ ID NO: 13 
and the termination codon spans from nucleotide 1272 through nucleotide 1274 of SEQ 
ID NO: 13. The coding region encoding PCfNKAB 326 is represented by nucleic acid 

10 molecule nCfNKAB 97g , having a coding strand with the nucleic acid sequence represented 
by SEQ ID NO: 1 6 and a complementary strand with nucleic acid sequence represented by 
SEQ ID NO: 18. The amino acid sequence of PCfNKAB 326 , also represented by SEQ ID 
NO: 17 predicts that PCfNKAB 326 has an estimated molecular weight of about 37.7 kDa 
and an estimated pi of about 5. 

15 Comparison of amino acid sequence SEQ ID NO: 14 with amino acid sequences 

reported in GenBank indicates that SEQ ID NO: 14 showed the most homology, i.e., about 
46% identity, with a Drosophila melanogaster nervous system antigen 2 protein, 
GenBank Accession No. 88 1344. Comparison of SEQ ID NO: 16 with nucleic acid 
sequences reported in GenBank indicates that SEQ ID NO: 16 showed the most 

20 homology, i.e., about 52% identity, with a Drosophila melanogaster nervous system 
antigen 2 nucleic acid molecule, GenBank Accession number U22440. Percent identity 
calculations were performed using GCG version 9.0 using default parameters. 
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Example 7 

This example describes the cloning and sequencing of a C. felis ligand-gated ion 
channel nucleic acid molecule. This example also describes the expression of ligand- 
gated ion channel mRNA in a variety of flea tissues. 

A TA clone from the HMT EST library described in Example 1 was sequenced 
using standard sequencing methods and shown to have homology to a human ligand- 
gated chloride channel nucleic acid molecule. The clone was digested with EcoKL to 
excise an insert about 376 nucleotides in length, referred to as flea LGIC nucleic acid 
molecule nCfLGIC 376 . The insert was isolated by gel purification using a Gel Purification 
kit, available from Qiagen. Approximately 50 ng of purified nCfLGIC 376 was used to 
construct a 32 P a-dATP labeled DNA probe using a Megaprime DNA labeling kit 
available from Amersham, using the manufacturer's protocols. 

The 32 P a-dATP labeled probe was used in a plaque lift hybridization procedure to 
isolate a clone from the HMT lambda-ZAP unsubtracted cDNA library described in 
Example 1, using standard hybridization conditions described in Example 4. Plaques that 
hybridized strongly to the probe were isolated and subjected to in vivo excision. In vivo 
excision was performed using the Stratagene Ex- Assist™ helper phage system and 
protocols, to convert a positive plaque to pBluescript™ plasmid DNA and sequencing 
was conducted following preparation of DNA with a Qiagen Qiaprep™ spin mini prep kit 
using the manufacturer's instructions and restriction enzyme digestion with about 1 \il of 
20 XJ/\il each of EcoKL and Xhol, available from New England Biolabs. A clone was 
isolated from a secondary plaque, containing a nucleic acid molecule of about 2240 base 
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pairs, referred to herein as n nCfLGIC 2240 , having a nucleotide sequence denoted herein as 
SEQ ID NO: 19. The complement of SEQ ID NO: 19 is represented herein as SEQ ID 
NO:21. Sequencing of nCfLGIC 376 indicates that nCfLGIC 376 shared 100% identity with 
nucleotides 763 through 1138 of SEQ ID NO: 19. 

5 Translation of SEQ ID NO: 1 9 suggests that nucleic acid molecule nCfLGIC 22 4 0 

encodes a partial-length LGIC protein of 569 amino acids, referred to herein as 
PCfLGIC 569 , having an amino acid sequence represented by SEQ ID NO:20, assuming the 
initiation codon spans from nucleotide 1 through nucleotide 3 of SEQ ID NO: 19 and the 
termination codon spans from nucleotide 1708 through nucleotide 1710 of SEQ ID 

10 NO: 19. The coding region encoding PCfLGIC 569 , is represented by nucleic acid molecule 
nCfLGIC 1707 , having a coding strand with the nucleic acid sequence represented by SEQ 
ID NO: 22 and a complementary strand with nucleic acid sequence represented by SEQ ID 
NO:24. The amino acid sequence of PCfLGIC 569 also represented as SEQ ID NO:23, 
predicts that PCfLGIC 569 has an estimated molecular weight of about 64 kDa and an 

1 5 estimated pi of about 6.6. 

Comparison of amino acid sequence SEQ ID NO:20 with amino acid sequences 
reported in GenBank indicates that SEQ ID NO:20 showed the most homology, i.e., about 
23% identity, with a Rattus norvegicus glycine receptor alpha-3 chain precursor protein, 
GenBank Accession No. 121580. Comparison of SEQ ID NO:22 with nucleic acid 

20 sequences reported in GenBank indicates that SEQ ED NO: 22 showed the most 

homology, i.e., about 38% identity, with a human glycine receptor alpha-3 subunit nucleic 
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acid molecule, GenBank Accession number AF017715. Percent identity calculations 

were performed using GCG version 9.0 using default parameters. 

Northern Blot analysis was conducted as described in Example 4 to determine 
whether LGIC mRNA is expressed in only HMT tissue. Total RNA was extracted from 
HMT tissues and non-HMT tissues as described in Example 4. Each RNA sample was 
separated by gel electrophoresis, transferred to nylon membranes and hybridized with cc- 
32 P-ATP labeled nCfLGIC 376 under the Northern Blotting conditions described in 
Example 4. Analysis of the developed film revealed that LGIC mRNA was expressed in 
HMT tissues but not in non-HMT tissues. 

Additional nucleic acid sequence corresponding to the coding regions at the 5' end 
of the LGIC cDNA described above was isolated by PCR using the RACE cDNA pool 
prepared as described in Example 3 as the template. A first PCR reaction was conducted 
using reverse primer LGIC-R4, which is complementary to nucleotides 200-223 of SEQ 
ID NO: 19, having a nucleic acid sequence 5* GCG ATA CTG GTG GTA CTG GTG 
AAG 3', denoted herein as SEQ ID NO: 1932 was used with the forward linker primer 
Adapter Primer 1, having a nucleic acid sequence 5* CCA TCC TAA TAC GAC TCA 
CTA TAG GGC 3', denoted herein as SEQ ID NO: 1933 using standard PCR reaction 
conditions and the following thermocycling conditions: (1) 94°C for 30 seconds, (2) 5 
cycles of 94°C for 10 seconds then 72°C for 4 minutes, (3) 5 cycles of 94°C for 10 
seconds then 70°C for 4 minutes, and (4) 25 cycles of 94°C for 10 seconds then 68°C for 
4 minutes. The reaction product was separated on a 1.5% agarose gel and stained by 
ethidium bromide, but no clear bands were seen. The first PCR reaction product was 
diluted 1 :50 in water and used as template for a second PCR reaction using reverse 
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primer LGIC-R5, which is complementary to nucleotides 88-1 10 of SEQ ID NO: 19, 
having a nucleic acid sequence 5' GAG GTG GTT GTC TTC AGT GGT TG 3', denoted 
herein as SEQ ID NO: 1934 and forward Adapter Primer 2, having a nucleic acid 
sequence 5' ACT CAC TAT AGG GCT CGA GCG GC 3', denoted herein as SEQ ID 

5 NO: 1935 under the same reaction conditions described for the first PCR reaction. The 
reaction product was separated by electrophoresis on a 1.5% agarose gel and stained with 
ethidium bromide revealing an approximately 700 bp band. This band was cut from the 
gel and purified using the QIAquick Gel Extraction Kit, then ligated into the pCR II TA 
Cloning vector, available from Invitrogen Corporation, Carlsbad, CA, using the 

10 manufacture's protocol. This clone, referred to herein as nCfLGIC 613 and having a coding 
sequence denoted SEQ ID NO: 1859, and a complementary strand denoted herein as SEQ 
ID NO: 1860 was sequenced using an ABI PRISM 377 automatic DNA Sequencer, 
available from Perkin Elmer, Branchburg, NJ. Sequence analysis revealed that 
nucleotides 503-613 of nCfLGIC 613 had 100% identity with nucleotides 1-1 10 of SEQ ID 

15 NO: 19. The two sequences were aligned to form a 2739 nucleotide contiguous sequence, 
referred to herein as nCfLGIC 2739 , having a coding strand denoted herein as SEQ ID 
NO: 1861 and a complementary strand denoted herein as SEQ ID NO:1863. Translation 
of SEQ ID NO: 1861 suggests that nucleic acid molecule nCfLGIC 2739 encodes a full- 
length LGIC protein of 672 amino acids, referred to herein as PCfLGIC 672 , having an 

20 amino acid sequence represented by SEQ ID NO: 1862, assuming the initiation codon 
spans from nucleotide 191 through nucleotide 193 of SEQ ID NO: 1861 and the 
termination codon spans from nucleotide 2207 through nucleotide 2209 of SEQ ID 
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NO: 1 861 . The coding region encoding PCfLGIC 672 , is represented by nucleic acid 
molecule nCfLGIC 2016 , having a coding strand with the nucleic acid sequence represented 
by SEQ ID NO: 1864 and a complementary strand with nucleic acid sequence represented 
by SEQ ID NO:1866. The amino acid sequence of PCfLGIC 672 i.e. SEQ ID NO:1862, 
predicts that PCfLGIC 672 has an estimated molecular weight of about 75.5 kDa and an 
estimated pi of about 5.89. 

Comparison of amino acid sequence SEQ ID NO: 1862 with amino acid sequences 
reported in GenBank indicates that SEQ ID NO: 1862 showed the most homology, i.e., 
31.4% identity with glycine receptor Alpha 3 chain precursor cDNA from Rattus 
norvegicus (Accession # P24524). Comparison of SEQ ID NO: 1 864 with nucleic acid 
sequences reported in GenBank indicates that SEQ ID NO: 1864 showed the most 
homology, i.e., about 43.1% identity with the Homo sapiens glycine receptor, alpha3 
cDNA (Accession # NP0O6520). Percent identity calculations were performed using 
GCG version 9.0 using default parameters. 

A LGIC nucleic acid molecule for recombinant expression of the predicted 
extracellular domain was produced as follows. In order to ligate the region encoding the 
predicted extracellular domain of the LGIC cDNA into the InsectSelect™ expression 
vector pIB/V5-His, two separate but overlapping DNA fragments were generated to be 
used as the template in the PCR overlap extension. To generate a 3' DNA fragment, a 
first PCR reaction was conducted using forward primer LGIC-ECD-D2F, which 
corresponds to nucleotides 2-25 of SEQ ID NO: 19, having a nucleic acid sequence 5' 
CAA TTT TAA ACG CAT CCA CGA CCG 3', denoted herein as SEQ ID NO: 1936, and 
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reverse primer LGIC-ECD-RE, which is complementary to nucleotides 937-961 of SEQ 
ID NO: 19, having a nucleic acid sequence 5' CCG CTC GAG CGA CCC ATT TCA 
CGA CTT ATT TGA ATC G 3\ denoted herein as SEQ ID NO: 1 937 and having a Xhol 
site indicated in bold, to amplify nucleotides 2-963 from SEQ ID NO: 19 which was used 
as template under standard PCR reaction conditions and the following thermocycling 
conditions: (1) 94°C for 30 seconds, (2) 25 cycles of 94°C for 10 seconds, 55°C for 10 
seconds, and 72°C for 3 minutes. The products of this reaction were separated on a 1.5% 
agarose gel, and a band corresponding to an approximately 960 nucleotide molecule was 
cut from the gel and purified using the QIAquick Gel Extraction Kit as described above. 
To generate a 5' cDNA fragment, a second PCR reaction was conducted using reverse 
primer LGIC-R5 (SEQ ID NO: 1934) and forward primer LGIC-ECD-FE, which 
corresponds to nucleotides 188-215 of SEQ ID NO: 1859, having a nucleic acid sequence 
5* GGA ATT CTA AAA TGC ACA ACA AAA TCC TGG TCC TGG 3\ denoted herein 
as SEQ ID NO:1938, and having an EcoKL site indicated in bold, using SEQ ID NO:1859 
as the template under the thermocycling conditions described for generating the 3' 
fragment. The products of this reaction were separated on a 1.5% agarose gel, and a band 
corresponding to an approximately 425 nucleotide molecule was cut from the gel and 
purified using the QIAquick Gel Extraction Kit as described above. 

For the PCR overlap extension reaction, the 5' and 3* cDNA fragments described 
above were used as the template in a PCR reaction with forward primer LGIC-ECD-FE 
and reverse primer LGIC-ECD-RE under the thermocycling conditions described for 
generating the 5' and 3' fragments. The products of this reaction were separated on a 
1 .5% agarose gel, and a band corresponding to an approximately 1300 nucleotide 
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molecule, as visualized by agarose gel electrophoresis and ethidium bromide staining, 
referred to herein as nCfLGIC 1300 , was cut from the gel and purified using the QIAquick 
Gel Extraction Kit as described above. 

The product of the PCR overlap extension reaction was the digested with EcoKL 

5 and Xhol restriction endonucleases, available from New England BioLabs, Inc., Beverly, 
MA, for 18 hours at 37 * . The digestion product was purified using the QIAquick 
Nucleotide Removal Kit, available from Qiagen, and ligated into the vector pIB/V5-His 
which had also been digested with EcoKL and Xhol and treated with shrimp alkaline 
phosphatase, available from New England BioLabs, Inc. for 30 minutes at 37 * . 

10 Following standard transformation procedures, a bacterial clone containing the plasmid 
pIB/V5-His-nCfLGIC 1300 was isolated. DNA sequence analysis of pIB/V5-His- 
nCfLGIC 1300 confirmed that nucleotides 188-1464 of SEQ ID NO: 1861 had been 
successfully ligated into the pIB/V5-His expression vector in frame with the C-terminal 
V5 epitope encoded by the vector. 

15 Example 8 

This example describes the cloning and sequencing of a C.felis ANON/23DA 
nucleic acid molecule. This example also describes the expression of ANON/23DA 
mRNA in a variety of flea tissues. 

A TA clone from the HMT EST library described in Example 1 was sequenced 
20 using standard sequencing methods and shown to have homology to an ANON/23DA 
gene from Drosophila melanogaster. This clone was digested with EcoRI to excise an 
insert about 177 nucleotides in length, referred to as flea ANON nucleic acid molecule 
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nCfANON 177 . The insert was isolated by gel purification using a Gel Purification kit, 
available from Qiagen. Approximately 50 ng of purified nC£ANON 177 was used to 
construct a 32 P a-dATP labeled DNA probe using a Megaprime DNA labeling kit, 
available from Amersham, using the manufacturer's protocols. 
5 The 32 P a-dATP labeled probe was used in a plaque lift hybridization procedure to 

isolate a clone from the HMT lambda-ZAP unsubtracted cDNA library described in 
Example 1, using standard hybridization conditions described in Example 4. Plaques that 
hybridized strongly to the probe were isolated and subjected to in vivo excision. In vivo 
excision was performed using the Stratagene Ex- Assist™ helper phage system and 

10 protocols, to convert a positive plaque to pBluescript™ plasmid DNA and sequencing of 
DNA was conducted following preparation with a Qiagen Qiaprep™ spin mini prep kit 
using the manufacturer's instructions and restriction enzyme digestion with about 1 |il of 
20 U/n-1 each of EcoRL and Xhol, available from New England Biolabs. A clone was 
isolated from a secondary plaque, containing a nucleic acid molecule of about 1429 base 

15 pairs, referred to herein as nCfANON 1429 , having a nucleotide sequence denoted herein as 
SEQ ID NO:25. The complement of SEQ ID NO:25 is represented herein as SEQ ID 
NO:27. Sequencing of nC£ANON l77 indicates that nCfANON 177 shared 100% identity 
with nucleotides 279 through 455 of SEQ ID NO:25. 

Translation of SEQ ID NO:25 suggests that nucleic acid molecule nCfANON 1429 

20 encodes a full-length ANON protein of 398 amino acids, referred to herein as 

PCfANON 398 , having an amino acid sequence represented by SEQ ID NO:26, assuming 
the initiation codon spans from nucleotide 18 through nucleotide 20 of SEQ ID NO: 25 
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and the termination codon spans from nucleotide 1212 through nucleotide 1214 of SEQ 
ID NO:25. The coding region encoding PCfANON 398 , is represented by nucleic acid 
molecule nCfANON ll94 , having a coding strand with the nucleic acid sequence 
represented by SEQ ID NO:28 and a complementary strand with nucleic acid sequence 
5 represented by SEQ ID NO:30. The amino acid sequence of PCfANON 398 , also 

represented as SEQ ID NO:29, predicts that PCfANON 398 has an estimated molecular 
weight of about 45 kDa and an estimated pi of about 8.8. 

Comparison of amino acid sequence SEQ ID NO:26 with amino acid sequences 
reported in GenBank indicates that SEQ ID NO:26 showed the most homology, i.e., about 
10 65% identity, with a Drosophila melanogaster ANON/23DA protein, GenBank 

Accession No. 924937. Comparison of SEQ ID NO:28 with nucleic acid sequences 



L. reported in GenBank indicates that SEQ ID NO:28 showed the most homology, i.e., about 

m 

M- 60% identity, with a Drosophila melanogaster ANON/23DA nucleic acid molecule, 



GenBank Accession number U29170. Percent identity calculations were performed using 

15 GCG version 9.0 using default parameters. 

Northern Blot analysis was conducted as described in Example 4 to determine 
whether ANON mRNA is expressed in only HMT tissue, only in certain stages of the flea 
life cycle and whether ANON mRNA expression is influenced by feeding. Total RNA 
was extracted from flea tissues, life stages and feeding conditions as described in 

20 Example 4. Each RNA sample was separated by gel electrophoresis, transferred to nylon 
membranes and hybridized with a- 32 P-ATP labeled nCfANON 177 under the Northern 
Blotting conditions described in Example 4. Analysis of the developed film revealed that 
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ANON mRNA was expressed in non-HMT tissues but not in HMT tissues. ANON 
mRNA was also detected in all adult fleas tested regardless of feeding conditions and in 
the wandering larvae and pupal life stages. 
Example 9 

This example describes the cloning and sequencing of a C.felis malvolio nucleic 
acid molecule. 

A TA clone from the HMT EST library described in Example 1 was digested with 
EcoKL to excise an insert about 432 nucleotides in length, referred to as nC£MALV 432 . 
The insert was isolated by gel purification using a Gel Purification kit, available from 
Qiagen and sequenced using standard sequencing methods and shown to have homology 
to a malvolio gene from Drosophila melanogaster, hereinafter referred to as a flea MALV 
nucleic acid molecule. 

Sequence information from nCfMALV 432 was used to design PCR primers to 
amplify a C.felis MALV nucleic acid molecule from the HMT unsubtracted library 
described in Example 1 using a nested PCR as follows. Sense primer MALV Rl, having 
the nucleotide sequence 5' CCA TTA TTA ACC TGG TCG ACC AC 3', designated SEQ 
ID NO:41 and corresponding to nucleotides 365-387 of nCfMALV 432 and reverse primer 
M13 Reverse, having the nucleotide sequence 5' GGA AAC AGT ATG ACC ATG 3', 
designated SEQ ID NO:42 were used in a first PCR reaction using HMT unsubtracted 
library as the template using standard PCR reaction and thermocycling conditions, with 
the exception that 2 of template was used. The reaction product from the first PCR 
reaction was diluted 1:50 and used as the template in a second PCR reaction as follows. 
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Reverse primer malvolio R2, having a nucleotide sequence 5' CGC TAT AGT CGG TAG 
GGT CGC 3 f , designated SEQ ID NO:43 and corresponding to nucleotides 239-259 of 
nCfMALV 432 and forward primer T3 , having a nucleotide sequence 5 ! AAT TAA CCC 
TCA CTA AAG GG 3 1 were used in a second PCR reaction under standard PCR reaction 

5 and thermocycling conditions. 

The second PCR reaction resulted in an approximately 1000 bp PCR product 
which was separated by electrophoresis on a 1.5% agarose gel, excised and purified using 
a Gel Purification Kit, available from Qiagen. The purified PCR product was ligated into 
the pCRff™, Original TA cloning vector, available from Invitrogen. The ligation 

10 reaction was then used to transform INVaF One Shot™ competent cells, available from 
Invitrogen, which were plated on LB agar with 50 micrograms per milliliter (|xg/ml) 
ampicillin, available from Sigma-Aldrich Co., and 50 |ig/ml X-Gal, available from Fisher 
Biotech. A clone was isolated from the ligation mix containing a nucleic acid molecule 
of about 765 base pairs, referred to herein as nCfMALV 765 , having a nucleotide sequence 

15 denoted herein as SEQ ID NO:31. The complement of SEQ ID NO:31 is represented 
herein as SEQIDNO:33. 

Translation of SEQ ID NO: 31 suggests that nucleic acid molecule nC£MALV 765 
encodes a partial-length MALV protein of 254 amino acids, referred to herein as 
PCfMALV 254 , having an amino acid sequence represented by SEQ ID NO:32, assuming 

20 the initiation codon spans from nucleotide 2 through nucleotide 4 of SEQ ID NO: 3 1 and 
the last codon spans from nucleotide 761 through nucleotide 763 of SEQ ID NO:3 1 . The 
coding region encoding PCfMALV 254 , is represented by nucleic acid molecule 
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nCfMALV 762 , having a coding strand with the nucleic acid sequence represented by SEQ 
ED NO:34 and a complementary strand with nucleic acid sequence represented by SEQ ID 
NO:36. The amino acid sequence of PCfMALV 2543 also represented as SEQ ID NO:35, 
predicts that PCfMALV 254 has an estimated molecular weight of about 36 kDa and an 

5 estimated pi of about 4.9. 

Comparison of amino acid sequence SEQ ID NO:32 with amino acid sequences 
reported in GenBank indicates that SEQ ID NO:32 showed the most homology, i.e., about 
71% identity, with a Drosophila melanogaster malvolio protein, GenBank Accession No. 
780776. Comparison of SEQ ID NO:34 with nucleic acid sequences reported in 

10 GenBank indicates that SEQ ID NO:34 showed the most homology, i.e., about 63% 
identity, with a Drosophila melanogaster malvolio nucleic acid molecule, GenBank 
Accession number U23948. Percent identity calculations were performed using GCG 
version 9.0 using default parameters. 
Example 10 

15 This example describes the cloning, sequencing, and recombinant expression of a 

C.felis odorant-binding protein-like (OS-D) nucleic acid molecule. This example also 
describes the expression of OS-D mRNA in a variety of flea tissues. 

A C. felis OS-D nucleic acid molecule of about 311 nucleotides was isolated from 
a cat blood-fed adult flea cDNA library, prepared as described in example 8 of PCT 

20 publication WO 96/1 1706 by Grieve et al., published April 25, 1996, by PCR 

amplification as follows. Sense primer 5'newBsaI5\ having a nucleotide sequence 5' 
CAA AAC TGG TCT CCC CGC TC 3', denoted SEQ ID NO:57 was used in 
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combination with vector primer T7, having a nucleic acid sequence 5' TAA TAC GAC 
TCA CTA TAG GG 3', denoted SEQ ID NO:58, in a first PCR reaction using the cat 
blood-fed adult flea cDNA library as the template under standard PCR reaction and 
thermocycling conditions. A 3 1 1-nucleotide fragment, denoted nCfOSD 311 was isolated 

5 and shown to encode a partial length protein of 45 amino acids having a sequence similar 
to Drosophila melanogaster OS-D protein. Since primer 5'newBsaI5 ! was designed to be 
specific for the C. felis serpin constant region, nCfOSD 311 is believed to have been 
fortuitously amplified in this PCR reaction. 

To isolate a flea OS-D nucleic acid molecule encoding a full-length OS-D protein, 

10 nucleic acid molecule nCfOSD 3n was used to design primers for a nested PCR as 

follows. Sense primer OSD-R1, having a nucleotide sequence 5 f GGT TCG CCT CTC 
TTC ACT TG 3', which is complementary in sequence to nucleotides 108-127 of 
nCfOSD 311 , denoted SEQ ID NO:59, was used in combination with Ml 3 reverse primer, 
SEQ ID NO:54, in a first PCR reaction using the cat blood-fed adult C felis cDNA 

1 5 library as the template. The product of the first reaction was diluted 1 :50 and used as the 
template for a second PCR reaction using reverse primer OSD-R2, having a nucleotide 
sequence 5' CGG TTG GAT CGT AAA CTG CAG 3 ! , which is complementary in 
sequence to nucleotides 52-72 of nCfOSD 311 , denoted SEQ ID NO:60, and forward 
primer T3, SEQ ID NO:56. Each PCR reaction was conducted under standard PCR 

20 reaction and thermocycling conditions with the exception that an annealing temperature 
of 55°C was used rather than 50°C. 
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A DNA fragment of about 365 nucleotides, referred to herein as nCfOSD 365 , was 
isolated from the second PCR product and purified using a Gel Purification Kit, available 
from Qiagen. The purified fragment was ligated into the pCRU™ TA cloning vector, 
available from Livitrogen, and sequenced using standard sequencing methods. 

5 Sequencing revealed that nucleotides 294-365 of nCfDSD 365 match nucleotides 1-72 of 
molecule nCfOSD 311 described above. The sequences from the partial length clones 
described were aligned to produce a sequence including a full-length coding region of 
604 nucleotides, referred to as nCfOSD 604 , denoted herein as SEQ ID NO:37, where 
nCfOSD 3U is identical in sequence to nucleotides 294-604 of SEQ ID NO:37 and 

10 nCfOSD 365 is identical in sequence to nucleotides 1-365 of SEQ ID NO:37. The 
complement of SEQ ID NO:37 is represented herein as SEQ ID NO:39. 

Translation of SEQ ID NO:37 suggests that nucleic acid molecule nCfOSD 604 
encodes a full-length OS-D protein of 135 amino acids, referred to herein as PCfOSD 135 , 
having an amino acid sequence represented by SEQ ID NO:38, assuming the initiation 

15 codon spans from nucleotide 26 through nucleotide 28 of SEQ ID NO:37 and the 

termination codon spans from nucleotide 431 through nucleotide 433 of SEQ ID NO:37. 
The coding region encoding PCfOSD 135 , is represented by nucleic acid molecule 
nCfOSD 405 , having a coding strand with the nucleic acid sequence represented by SEQ ID 
NO:40 and a complementary strand with nucleic acid sequence represented by SEQ ID 

20 NO:42. The amino acid sequence of PCfOSD 135 , also represented as SEQ ID NO:41, 
predicts that PCfOSD 135 has an estimated molecular weight of about 15 kDa and an 
estimated pi of about 8.6. Analysis of SEQ ID NO:38 suggests the presence of a signal 
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peptide encoded by a stretch of amino acids spanning from about amino acid 1 through 
about amino acid 20. The proposed mature protein, denoted herein as PCfOSD ll5 , 
contains about 115 amino acids corresponding to amino acids 21 through 135 of SEQ ID 
NO:38. The predicted pi of the mature protein (i.e. the protein with the signal peptide 
5 removed) is 6.6. 

Comparison of amino acid sequence SEQ ID NO:38 with amino acid sequences 
reported in GenBank indicates that SEQ ID NO:38 showed the most homology, i.e., about 

□ 

43 60% identity, with a Schistocerca gregaria chemosensory protein CSP-sg4, GenBank 

fj Accession No. 3283938. Comparison of SEQ ID NO:40 with nucleic acid sequences 

J7; 10 reported in GenBank indicates that SEQ ID NO:40 showed the most homology, i.e., about 

Z 58% identity, with a Schistocerca gregaria chemosensory protein CSP-sg4 nucleic acid 

H» molecule, GenBank Accession number AF070964. Comparison of SEQ ID NO:40 with 

i Li 

Hj nucleic acid molecules sequenced when screening the HNC subtracted and unsubtracted 

^ libraries described in Example 2 revealed that OS-D (i.e. SEQ ID NO:40) is expressed in 

15 each of these libraries. Additional sequence analysis revealed that there are four 

cysteines present in C.felis OS-D which are conserved in sequence alignments with the 
four cysteines of OS-D-like molecules of other insects, including D. melanogaser OS-D 
protein GenBank Accession No. U02546, S. gregaria chemosensory protein CSP-sg4, 
GenBank Accession number AF070964, and cockroach leg regenerative protein, 
20 GenBank Accession No. AF030340. Percent identity calculations and additional 
sequence analysis was performed using GCG version 9.0 using default parameters. 
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A nucleic acid molecule comprising nucleotides 91 through 447 of SEQ ID 
NO:37, encoding a predicted mature flea OS-D protein, was PCR amplified using the 
pBluescript™ clone described above as the template, using sense primer OSD-FE, having 
nucleotide sequence 5' CGC GGA TCC AGA AGA TAA ATA TAC TAG CAA ATT 
5 TGA TAA C 3\ having a BamHL site indicated in bold, designated herein as SEQ ID 
NO:61, and anti-sense primer OSD-RE, having nucleotide sequence 5 1 GAG GAA TTC 
CTC TTT TTG GAA ATT TAA ACT GTA ACG G 3', having an EcoW site indicated in 
y3 bold, designated herein as SEQ ID NO: 62. PCR reactions were performed using standard 

PCR reaction and thermocycling conditions described in Example 4; the product was 
rj 10 separated by agarose gel electrophoresis, and a fragment was excised and purified using a 

~ Gel Purification Kit, available from Qiagen. The fragment was digested with BamHL and 

M EcoRL and ligated into the vector pTrcHisB, available from Invitrogen, that had been 

digested with BarriHI and EcoKL and treated with alkaline phosphatase. The resulting 
^ recombinant molecule, referred to herein as pTrc-nCfOSD 357 , was transformed into E. 

15 coli strain BL21, available from Novagen, to form recombinant cell E. co/z :pTrc- 
nCfOSD 357 . 

The recombinant cell was grown under standard conditions then incubated in the 
presence of 0.5 mM IPTG to induce expression of recombinant protein, predicted to be 
approximately 17-kDa. Expression of protein was confirmed using Coomassie-blue- 
20 stained Tris-glycine gel and by Western blot using a T7 tag antibody which showed 
expression of an about 17 kDa protein. 
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A Northern Blot analysis was conducted as follows to determine whether OS-D 
mRNA is expressed exclusively in HNC tissues. HNC tissues were dissected from 1500 
adult cat blood-fed C.felis having a male to female ratio of 1:4. Total RNA was 
separately extracted from HNC tissues and the HNC-less carcasses that resulted from 
5 these dissections using a standard guanidine lysis method, described by Sambrook et al., 
ibid. 

Approximately 15 jig of each RNA were separated by electrophoresis on either 
Glyoxal gels with RNA prepared according to Burnett, Biotechniques, 22:4, pp. 668-671, 
1997, or formaldehyde gels with RNA prepared according to Sambrook et al., ibid. 

10 Following electrophoresis, RNA was blotted to Hybond N nylon membranes, available 
from Amersham, according to the protocols described in Burnett and Sambrook et el 
ibid. The membrane was UV cross-linked using the Stratalinker®, available from 
Stratagene, and placed in approximately 30 ml of hybridization buffer consisting of 5X 
SSPE, 1% Sarcosyl, 50% formamide, 5X Denhardt's reagent and 25 mM EDTA at 42°C 

15 for approximately 3 to 6 hours. A probe comprising the flea OS-D nucleic acid molecule 
nCfOSD 357 was labeled with ct- 32 P-ATP using a DNA labeling kit, available from 
Amersham and added to the buffer at a concentration of approximately 1 x 10 6 cpm/ml, 
and allowed to hybridize for about 14 to 18 hours at 42°C. The blot was then washed 
twice for 10 minutes per wash in 0.5X SSPE and 0.1% sarcosyl at 55°C and exposed to 

20 film for autoradiography. Analysis of the developed film showed that there was greater 
expression of OS-D mRNA in HNC tissues compared to non-HNC tissues, indicating 
possible upregulation of OS-D in flea head and nerve cords. 
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Example 1 1 

This example describes the cloning and sequencing of a C.felis N-methyl-D- 
aspartate receptor associated (NMD A) nucleic acid molecule. 

A TA clone from the HMT EST library described in Example 1 was sequenced 

5 using standard sequencing methods and shown to have significant homology to NMDA 
genes. This clone was digested with EcdRl to excise an insert 279 nucleotides in length, 
referred to as flea NMDA nucleic acid molecule nCfNMDA 279 . The insert was isolated 
by gel purification using a Gel Purification kit, available from Qiagen. Approximately 50 
ng of purified nCfNMDA 279 was used to construct a 32 P a-dATP labeled DNA probe 

10 using a Megaprime DNA labeling kit, available from Amersham, using the 
manufacturer's protocols. 

The 32 P a-dATP labeled probe was used in a plaque lift hybridization procedure to 
isolate a clone from the HMT lambda-ZAP unsubtracted cDNA library described in 
Example 1, using standard hybridization conditions described in Example 4. Plaques that 

15 hybridized strongly to the probe were isolated and subjected to in vivo excision. In vivo 
excision was performed using the Stratagene Ex- Assist™ helper phage system and 
protocols, to convert a positive plaque to pBluescript™ plasmid DNA and sequencing 
was conducted following preparation of DNA with a Qiagen Qiaprep™ spin mini prep kit 
using the manufacturer's instructions and restriction enzyme digestion with about 1 \A of 

20 20 U/pl each of EcoKL and Xhol, available from New England Biolabs. A clone was 

isolated from a secondary plaque, containing a nucleic acid molecule of about 1227 base 
pairs, referred to herein as nCfNMDA 1227 , having a nucleotide sequence denoted herein as 
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SEQ ID NO:43. The complement of SEQ ID NO:43 is represented herein as SEQ ED 
NO:45. Sequencing of nCfNMDA 279 indicates that nCfNMDA 279 shared 100% identity 
with nucleotides 709 through 987 of SEQ ID NO:43. 

Translation of SEQ ED NO:43 suggests that nucleic acid molecule nCfNMDA 1227 
encodes a full-length NMDA protein of 246 amino acids, referred to herein as 
PCfNMDA 246 , having an amino acid sequence represented by SEQ ID NO:44, assuming 
the initiation codon spans from nucleotide 312 through nucleotide 314 of SEQ ID NO:43 
and the termination codon spans from nucleotide 1050 through nucleotide 1052 of SEQ 
ID NO:43. The coding region encoding PCfNMDA 246 , is represented by nucleic acid 
molecule nCfNMDA 738 , having a coding strand with the nucleic acid sequence 
represented by SEQ ID NO:46 and a complementary strand with nucleic acid sequence 
represented by SEQ ID NO:48. The amino acid sequence of PCfNMDA 2465 also 
represented as SEQ ID NO:47 predicts that PCfNMDA 246 has an estimated molecular 
weight of about 27 kDa and an estimated pi of about 5.6. 

Comparison of amino acid sequence SEQ ED NO:44 with amino acid sequences 
reported in GenBank indicates that SEQ ID NO:44 showed the most homology, i.e., about 
34% identity, with a Emericella nidulans negative-acting regulatory protein, GenBank 
Accession No. 3676056. Comparison of SEQ ID NO:46 with nucleic acid sequences 
reported in GenBank indicates that SEQ ID NO:46 showed the most homology, i.e., about 
45% identity, with & Drosophila melanogasterNMDA nucleic acid molecule, GenBank 
Accession number L37377. Percent identity calculations were performed using GCG 
version 9.0 using default parameters. 
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Example 12 

This example describes the cloning and sequencing of C.felis chemical sense 
related lipophilic ligand binding protein nucleic acid molecule. This example also 
describes the expression of chemical sense related lipophilic ligand binding protein 
5 mRNA in a variety of flea tissues. 

A TA clone from the HNC EST library described in Example 2 was sequenced 
using standard sequencing methods and shown to have significant homology to chemical 
sense related lipophilic ligand binding protein (CLBP) genes. This clone was digested 
with EcoKL to excise an insert 339 nucleotides in length, referred to as flea CLBP nucleic 
10 acid molecule nCfCLBP 339 . The insert was isolated by gel purification using a Gel 
Purification kit, available from Qiagen, Chats worth, CA. Approximately 50 ng of 
purified nCfCLBP 339 was used to construct a 32 P a-dATP labeled DNA probe using a 
Megaprime DNA labeling kit, available from Amersham, using the manufacturer's 
protocols. 

15 The 32 P a-dATP labeled probe was used in a standard plaque lift hybridization 

procedure to isolate a clone from the HNC lambda-ZAP unsubtracted cDNA library 
described in Example 2. The following hybridization conditions were used. Filters were 
hybridized with about 5 X 10 7 counts per minute (cpm) per ml of the probe in 100 ml of 
buffer (5X SSPE, 1 % Sarcosyl, 0.1 mg/ml BLOTTO) at 45°C for about 14 hours. The 

20 filters were washed twice for 20 minutes per wash in 500 ml of 0.5X SSPE and 0.1% 

Sarcosyl at 55°C and subjected to autoradiography. Two plaques that hybridized strongly 
to the probe were isolated and subjected to in vivo excision using the Stratagene 
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Ex-Assist™ helper phage system and protocols. Miniprep DNA was prepared from each 
positive clone using a Quantum Prep mini prep kit, available from BioRad, Hercules, CA, 
and sequenced using standard sequencing procedures. Sequencing revealed that the two 
positive clones share 97% amino acid identity to each other. The first clone contained a 
5 nucleic acid molecule of about 633 nucleotides, referred to herein as nCfCLBPlA 633 , 
having a nucleotide sequence denoted herein as SEQ ED NO: 153. The complement of 
SEQ ID NO: 153 is represented herein as SEQ ID NO: 155. The second clone contained a 
nucleic acid molecule of about 63 1 nucleotides, referred to herein as nCfCLBP2A 631 , 
having a nucleotide sequence denoted herein as SEQ ID NO: 162. The complement of 

10 SEQ ID NO:162 is represented herein as SEQ ID NO:164. Sequencing of nCfCLBP 340 
indicated that nCfCLBP 339 shared 100% identity with nucleotides 1 through 339 of SEQ 
ID NO:153 and shared 100% identity with nucleotides 2 through 339 of SEQ ID NO:162. 

Translation of SEQ ID NO: 153 suggests that nucleic acid molecule 
nCfCLBPl A^ encodes a full-length CLBP protein of 147 amino acids, referred to herein 

15 as PCfCLBPlA 147 , having an amino acid sequence represented by SEQ ID NO: 154, 

assuming the initiation codon spans from nucleotide 67 through nucleotide 69 of SEQ ID 
NO: 153 and the termination codon spans from nucleotide 511 through nucleotide 513 of 
SEQ ID NO:153. The coding region encoding PCfCLBPlA 147 , is represented by nucleic 
acid molecule nCfCLBPl A 441 , having a coding strand with the nucleic acid sequence 

20 represented by SEQ ID NO: 156 and a complementary strand with nucleic acid sequence 
represented by SEQ ID NO:158. The amino acid sequence of PCfCLBPlA 147 , also 
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represented by SEQ ID NO: 157, predicts that PCfCLBPlA l47 has an estimated molecular 
weight of about 15 kDa and an estimated pi of about 5. 

Analysis of SEQ ID NO: 154 suggests the presence of a signal peptide encoded by 
a stretch of amino acids spanning from about amino acid 1 through about amino acid 19. 

5 The proposed mature protein, denoted herein as PCfCLBPl A 128 , contains 128 amino 
acids which is represented herein as SEQ ID NO: 160. PCfCLBPl A 128 is encoded by a 
nucleic acid molecule denoted nCfCLBPlA 384 having a coding strand with nucleic acid 
sequence SEQ ID NO: 159 and a complementary strand with SEQ ID NO: 161. 
Translation of SEQ ID NO: 162 suggests that nucleic acid molecule 

10 nCfCLBP2A 63l encodes a full-length CLBP protein of 147 amino acids, referred to herein 
as PCfCLBP2A 147 , having an amino acid sequence represented by SEQ ID NO: 163, 
assuming the initiation codon spans from nucleotide 65 through nucleotide 67 of SEQ ID 
NO: 162 and the termination codon spans from nucleotide 509 through nucleotide 5 1 1 of 
SEQ ID NO: 162. The coding region encoding PCfCLBP2A l47 , is represented by nucleic 

15 acid molecule nCfCLBP2A 441 , having a coding strand with the nucleic acid sequence 
represented by SEQ ID NO: 165 and a complementary strand with nucleic acid sequence 
represented by SEQ ID NO: 167. The amino acid sequence of PCfCLBP2A 147 predicts 
that PCfCLBP2A 147 has an estimated molecular weight of about 15 kDa and an estimated 
pi of about 5. 

20 Analysis of SEQ ID NO: 1 63 suggests the presence of a signal peptide encoded by 

a stretch of amino acids spanning from about amino acid 1 through about amino acid 19. 
The proposed mature protein, denoted herein as PCfCLBP2A 12g , contains about 128 
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amino acids which is represented herein as SEQ ID NO: 169. PCfCLBP2A 128 is encoded 
by a nucleic acid molecule denoted nCfCLBP2A 384 having a coding strand with nucleic 
acid sequence SEQ ID NO: 168 and a complementary strand with SEQ ID NO: 170. 

Comparison of amino acid sequences SEQ ID NO: 154 and SEQ ID NO: 163 with 
amino acid sequences reported in GenBank indicates that each sequence showed the most 
homology, i.e., about 29% identity, with a Drosophila melanogaster pheromone binding 
protein related protein 2 (PBPRP-2), GenBank Accession No. 1709595. Percent identity 
calculations were performed using GCG version 9.0 using default parameters. Blast 
comparison of nucleic acid sequences SEQ ID NO: 156 and SEQ ID NO: 165 with nucleic 
acid sequences reported in GenBank indicates that each sequence showed the most 
homology to a human Xp22 PAC PRCI1-5G1 1 nucleic acid molecule, GenBank 
Accession number AC002369. Pairwise identity could not be performed as the human 
clone in GenBank is too large to load into GCG version 9.0. Blast comparison performed 
using default parameters showed an insignificant level of identity of 0.87. Additional 
sequence analysis revealed that there are six cysteines present in C. felis CLBP which are 
conserved in sequence alignments with the six cysteines of neuronal/sense-related 
molecules in the PBP/GOBP family, including D. melanogaser PBPRP-2, GenBank 
Accession No. 1709595, and PBPRP-5, GenBank Accession No. P54195, proteins, and 
Phormia regina chemical sense related lipophilic ligand binding protein (CSRLLBP), 
GenBank Accession No. S65458. 
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A Northern Blot analysis was conducted to determine whether CLBP mRNA is 
expressed exclusively in HNC tissues. HNC tissues were dissected, total RNA was 
isolated and separated by electrophoresis as described in Example 10. 

Following electrophoresis, RNA was blotted as described in Example 10 and a 

5 probe comprising clone nCfCLBP 340 labeled with a- 32 P-ATP was added to the buffer at a 
concentration of approximately 1 x 10 6 cpm/ml and allowed to hybridize for about 14 to 
1 8 hours. The blot was then washed as described in Example 10 and exposed to film for 
autoradiography. Analysis of the developed film showed that there was greater 
expression of CLBP mRNA in HNC tissues compared to non-HNC tissues, indicating 

1 0 possible upregulation of CLBP in flea head and nerve cords. 

The coding region of nCfCLBP2A 631) i.e. SEQ ID NO: 162, was PCR amplified 
from the pBluescript™ clone described above as the template, using sense primer 
2AlBamSen having nucleotide sequence 5' ATG GAT CCG GCA AAA TAT ACC 
AAA GAA GAA G 3', having a BamHI site indicated in bold, designated herein as SEQ 

15 ID NO: 1952, and anti-sense primer 2AlantiRl, having nucleotide sequence 5' ATG 
A AT TCT TAT ATT GGT ATC GCG TCC ATT 3', having a EcoW site indicated in 
bold, designated herein as SEQ ID NO: 1953. PCR reactions were performed using the 
following thermocy cling conditions: (a) one cycle at 95°C for one minute; (b) five cycles 
at 94°C for ten seconds, 49°C for twenty-five seconds, and 69°C for one minute; and (c) 

20 twenty-three cycles at 94°C for ten seconds, 53°C for twenty seconds, and 69°C for 

seventy-five seconds, in reactions containing 0.2 mM dNTPs, 1 p.M of each primer, 0.5 ^il 
of 5U/(il KlenTaq Advantage polymerase, available from Clontech, 1 ^1 of 1 
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template, and IX KlenTaq buffer, hereinafter referred to as "standard PCR conditions". 
The PCR product was digested with BaniHL and EcoBI and ligated into the vector 
pTrcHisB, available from Invitrogen, that had been digested with BarriHI and EcoKL. The 
resulting recombinant molecule, referred to herein as pTrc- nCfCLBP2A 441 , was 
transformed into E. coli strain BL21, available from Novagen Inc., Madison, WI, to form 
recombinant cell E. co#:pTrc- nCfCLBP2A 441 . 

The recombinant cell was grown under standard conditions and then incubated in 
the presence of 0.5 |nM isopropylthio-p-galactoside (IPTG) to induce expression of 
recombinant protein. Expression was confirmed using Coomassie-blue-stained Tris- 
glycine gel and by Western blot using a T7 tag antibody, available from Novagen, which 
showed expression of an about 18 kDa protein. The protein product was purified as 
follows. The recombinant cells were collected by centrifugation, the supernatant was 
discarded and the pellets were resuspended and homogenized in 60ml (total) of 50mM 
Tris pH8.0 containing 50mM NaCl and ImM phenylmethylsulfonyl fluoride(PMSF). 
The sample was then passed through the microfluidizer five times, rocked at 4°C for 20 
minutes, and centrifuged at 20,000 x G for 30minutes. The supernatant was collected and 
filtered through a 0.45um filter then run over a HiTrap Chelating column, available from 
Amersham Pharmacia, in 50mM Tris pH8 containing 50mM NaCl and lOmM 
immidazole and eluted with an increasing imidazole gradient. The recombinant protein 
was eluted at approximately 150mM imidazole. Fractions containing recombinant 
protein were pooled and concentrated using a Centricon Plus-20 (Amicon), and 
diafiltered into PBS. Quantification of the protein was performed by densitometry 
against a known standard. 
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Example 13 

This Example describes the further characterization and expression of a 
Sodium/Hydrogen Transporter-like cDNA isolated by EST sequencing described in 
Example 1. 

5 A cDNA designated clone 223 1-94 was isolated from the unsubtracted HMT 

library as described in Example 1. Analysis of clone 2231-94 indicated that the cDNA, 
denoted nCfNAH 2080 , is about 2080 nucleotides in length, having a coding strand with 
nucleic acid sequence SEQ ID NO: 1867 and a complementary sequence having SEQ ID 
NO: 1869. Translation of SEQ ID NO: 1867 suggests that nucleic acid molecule 

10 nCfNAH 2080 encodes a full-length Sodium/Hydrogen Transporter-like protein of 608 

amino acids, referred to herein as PCfNAH 608 , having an amino acid sequence represented 
by SEQ ID NO: 1868, assuming the initiation codon spans from nucleotide 45 through 
nucleotide 47 of SEQ ID NO: 1867 and the termination codon spans from nucleotide 1869 
through nucleotide 1871 of SEQ ID NO:1867. The coding region encoding PCfNAH^, 

15 is represented by nucleic acid molecule nCfNAH 1824 , having a coding strand with the 

nucleic acid sequence represented by SEQ ID NO: 1870 and a complementary strand with 
nucleic acid sequence represented by SEQ ID NO: 1871. The amino acid sequence of 
SEQ ID NO: 1868, predicts that PCfNAH 608 has an estimated molecular weight of about 
67.9 kDa and an estimated isoelectric point (pi) of about 6.47. 

20 Comparison of amino acid sequence SEQ ID NO: 1868 with amino acid sequences 

reported in GenBank indicates that SEQ ID NO: 1868 showed the most homology, i.e., 
about 67.7% identity, with a sodium hydrogen exchanger NHE1 (Accession # 
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AAD32689. 1). Comparison of SEQ ID NO: 1 867 with nucleic acid sequences reported in 
GenBank indicates that SEQ ID NO: 1867 showed the most homology, i.e., about 59.5% 
identity, with a Drosophila melanogaster sodium hydrogen exchanger NHE1 (Accession 
# AF142676). Percent identity calculations were performed using GCG version 9.0 using 
5 default parameters. 

In order to express the full-length putative NaH protein, the entire coding region 
was amplified by PCR and then ligated into the InsectSelect™ expression vector pIB/V5- 
J3 His, available from Invitrogen, as follows. Forward primer NaH-IS-FE, which 

y3 corresponds to nucleotides 42-74 of SEQ ID NO: 1867, having the sequence 5' GAC 

if"" 

J 1 0 TAG TAA AAT GGG CGT TAA AAA TAT ATA TTT ATA CTG C 3', denoted SEQ 

m 

; ID NO: 1939 and having a Spel site indicated in bold, was used in conjunction with 

L reverse primer NaH-IS-RE, which is complementary to nucleotides 1845-1867 of SEQ ID 

fy 

M NO: 1 867, having the sequence 5' CCG CTC GAG GTA CTG CAC GTA CTA ACG 

^ TCA TC 3\ denoted SEQ ID NO: 1940 and having a Xhol restriction site indicated in 

15 bold, in a PCR reaction using SEQ ID NO: 1867 as the template. Standard PCR reaction 
conditions were used with the following thermocy cling conditions: (1) 94°C for 30 
seconds, (2) 25 cycles of 94°C for 10 seconds, 55°C for 10 seconds and 72°C for 3 
minutes. The products of this reaction were separated on a 1.5% agarose gel, and a band 
corresponding to an approximately 1825 nucleotide molecule was cut from the gel and 
20 purified using the QIAquick Gel Extraction Kit as described above. The PCR product 
was then digested with Spel and Xhol restriction endonucleases for 18 hours at 37 ' . The 
digestion product was purified using the QIAquick Nucleotide Removal Kit, available 
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from Qiagen, and ligated into the vector pIB/V5-His which had also been digested with 
Spel and Xhol and treated with shrimp alkaline phosphatase, available from New England 
BioLabs, Inc., for 30 minutes at 37 * . Following standard transformation procedures, a 
bacterial clone containing the plasmid pIB/V5-His-NaH was isolated. DNA sequence 
analysis of the clone confirmed that nucleotides 42-1867 of SEQ ID NO: 1867, referred to 
herein as nCfNAH i826 , had been successfully ligated into the pEB/V5-His expression 
vector in frame with the C-terminal V5 epitope encoded by the vector. 

A Northern Blot analysis was conducted as described in Example 4 to determine 
whether NaH mRNA is expressed only in certain life stages of the flea life cycle and 
whether NaH mRNA is expressed only in HMT tissue. Total RNA was extracted from 
eggs, first, third, and wandering larvae, pupae, unfed adults, and adults fed on cat blood 
for 0.25, 2, 8, and 24 hours. In addition, total RNA was extracted from hindguts and 
Malpighian tubules extracted from 24 hour cat blood-fed adult fleas, and from the 
remaining body parts following the removal of hindguts and Malpighian tubules. Each 
RNA sample was separated by gel electrophoresis, transferred to nylon membranes and 
hybridized with a- 32 P-ATP labeled nCfNAH lg26 under the Northern Blotting conditions 
described in Example 4. Analysis of the developed film revealed that NAH mRNA was 
expressed in the 0.25, 2, and 8 hour adult fed timepoints only. 
Example 14 

This Example describes the further characterization of a Chloride Intracellular 
Channel-like cDNA isolated by EST sequencing described in Example 1. 
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A cDNA designated clone 2233-24 was isolated from the unsubtracted HMT 
library as described in Example 1. Analysis of clone 2233-24 indicated that the cDNA, 
denoted nCfCLIC 2283 is about 2283 nucleotides in length, having a coding strand with 
nucleic acid sequence SEQ ID NO: 1872 and a complementary sequence having SEQ ID 

5 NO: 1 874. Translation of SEQ ID NO: 1 872 suggests that nucleic acid molecule 

nCfCLIC2 283 encodes a full-length Chloride Intracellular Channel-like protein of 262 
amino acids, referred to herein as PCfCLIC 262 , having an amino acid sequence 
represented by SEQ ID NO: 1 873, assuming the initiation codon spans from nucleotide 60 
through nucleotide 62 of SEQ ID NO: 1872 and the termination codon spans from 

1 0 nucleotide 846 through nucleotide 848 of SEQ ID NO : 1 872. The coding region encoding 
PCfCLIC 262 , is represented by nucleic acid molecule nCfCLIC 786 , having a coding strand 
with the nucleic acid sequence represented by SEQ ID NO: 1875 and a complementary 
strand with nucleic acid sequence represented by SEQ ID NO: 1876. The amino acid 
sequence of SEQ ID NO: 1873, predicts that PCfCLIC 262 has an estimated molecular 

1 5 weight of about 30.2 kDa and an estimated isoelectric point (pi) of about 6.02. 

Comparison of amino acid sequence SEQ ID NO: 1873 with amino acid sequences 
reported in GenBank indicates that SEQ ED NO: 1873 showed the most homology, i.e., 
about 37.8% identity, with a Homo sapiens chloride intracellular channel 2 (Accession # 
NP001280. 1). Comparison of SEQ ID NO: 1 872 with nucleic acid sequences reported in 

20 GenBank indicates that SEQ ID NO:1872 showed the most homology, i.e., about 37.5% 
identity with a Homo sapiens chloride intracellular channel 2 (Accession # NM001289). 
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Percent identity calculations were performed using GCG version 9.0 using default 

parameters. 

Example 15 

This Example describes the further characterization of a Peritrophin-like cDNA, 
5 referred to herein as PL2, isolated by EST sequencing described in Example 1. 

A cDNA designated clone 2232-23 was isolated from the unsubtracted HMT 
library as described in Example 1 , denoted herein as SEQ ID NO: 1877. Analysis of 
clone 2232-23 indicated that the cDNA, denoted nCfPL2 457 is about 457 nucleotides in 
length. Translation of the coding strand of nCfPL2 457 suggests that nucleic acid molecule 
10 nCfPL2 457 encodes a partial-length Peritrophin-like protein of 1 13 amino acids, referred 
to herein as PCfPL2 U3 , assuming a stop coding at nucleotides 342-344 of nCfPL2 457 . 

Additional coding sequence corresponding to the 5' end of nCfPL2 457 was isolated 
by PCR performed using a RACE cDNA pool prepared as described in Example 3 as 
template. A first PCR reaction was performed using reverse primer PL2-R1, which is 
15 complementary to nucleotides 167-187 of the nCfPL2 457 cDNA, having a nucleic acid 
sequence 5' GTC TGG AAG CTC AGG AAG AGG 3', denoted herein as SEQ ID 
NO: 1941, in conjunction with forward Adapter Primer 1, SEQ ID NO: 1933, described 
above under the following thermocycling conditions: (1) 94°C for 30 seconds, (2) 5 
cycles of 94°C for 10 seconds and 72°C for 4 minutes, (3) 5 cycles of 94°C for 10 
20 seconds and 70°C for 4 minutes, and (4) 25 cycles of 94°C for 10 seconds then 68°C for 
4 minutes. The product of this reaction was diluted 1:50 and used as template for a 
second PCR reaction as follows. Forward adapter primer 2, SEQ ID NO: 1935, was used 
with reverse primer PL2-R2, which is complementary to nucleotides 29-52 of the 
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nCfPL2 457 cDNA, having a nucleic acid sequence 5' GTA ATA TGC GTG ACA ATC 
GTG TGG 3*, denoted herein as SEQ ID NO: 1942, using the thermocycling conditions 
described for the first PCR reaction. The resulting product was gel purified as described 
above to reveal a distinct band corresponding to nucleic acid molecule of approximately 

5 900 bp in length. The fragment was then ligated into the pCR II TA Cloning vector, 
available from Qiagen and sequenced using an ABI PRISM 377 automatic DNA 
Sequencer. Sequencing revealed that nucleotides 791-835 of the fragment had 100% 
identity with nucleotides 1-45 of the nCfPL2 457 cDNA. The two sequences were aligned 
to form a contiguous sequence, denoted nCfPL2 1291 , which is about 1291 nucleotides in 

10 length, having a coding strand with nucleic acid sequence SEQ ID NO: 1878 and a 
complementary sequence having SEQ ID NO: 1 879. Translation of SEQ ID NO: 1 878 
suggests that nucleic acid molecule nCfPL2 1291 encodes a non full-length Peritrophin-like 
protein of 391 amino acids. 

In order to isolate the additional sequence corresponding to the 5' end of SEQ ID 

1 5 NO: 1 878, nested PCR reactions were performed using the RACE cDNA pool as 

template. For the first PCR, forward adapter primer API was used with reverse primer 
PL2-R1 under standard PCR reaction conditions and the following thermocycling 
conditions: (1) 94 ' C for 1 minute, (2) 5 cycles of 94 ' C for 20 seconds and 70 ' C for 1 
minute, (3) 5 cycles of 94 * C for 20 seconds and 68 ' C for 1 minute, (4) 10 cycles of 

20 94 ' C for 20 seconds and 66 * C for 1 minute. The products of this reaction were diluted 
1:50 in water and used as template for the second, nested PCR. The second PCR reaction 
used forward adapter primer AP2 in conjunction with reverse primer PL2-R5, which is 
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complementary to nucleotides 70-93 of SEQ ID NO: 1878, having a nucleotide sequence 
5' CGG TGC AAG TTA TAG AAC CTT CCG 3', denoted herein as SEQ ID NO: 1943 
under standard PCR reaction conditions using the following thermocy cling conditions: 
(1) 94 ' C for 1 minute, (2) 5 cycles of 94 ' C for 20 seconds and 70 * C for 1 minute, (3) 5 
5 cycles of 94 * C for 20 seconds and 68 * C for 1 minute, (4) 40 cycles of 94 ' C for 20 

seconds and 66 ' C for 1 minute. The products of this reaction were separated by agarose 
gel electrophoresis and a band approximately 279 nucleotides in length was excised from 
/S the gel and purified as described above. The fragment, referred to as nCfPL2 279 having a 

U coding nucleic acid sequence designated SEQ ID NO: 1 880 and a complementary 

yj 1 0 sequence designated SEQ ID NO: 1 88 1 , was then ligated into the pCROII TA Cloning 

01 

I* s vector, available from Qiagen, and sequenced as described above. Sequencing revealed 

ry that nucleotides 228-279 of nCfPL2 279 were identical to nucleotides 42-93 of SEQ ID 

Li 

p NO: 1878, however, nucleotides 186-228 of nCfPL2 279 had no significant similarity to 

SEQ ID NO: 1878. This discrepancy may be the result of alternative RNA splicing or 

15 may be an artifact of the cDNA pool. To determine the reason for this discrepancy, 

additional fragments corresponding to this region were isolated by PCR from flea cDNA 
libraries from adult midguts, hindgut and Malpighian tubules and mixed instar larvae 
using techniques described herein. Sequence analysis of fragments obtained from these 
libraries revealed that these fragments were identical in sequence to the sequence of 

20 nCfPL2 279 , therefore, the region of SEQ ID NO: 1 878 which did not align to nCfPL2 279 
was deemed to be an artifact and was not used in subsequent alignments. 
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The PL2 sequences described above were aligned to form a contiguous sequence, 
denoted nCfPL2 1477 , which is about 1477 nucleotides in length, having a coding strand 
with nucleic acid sequence SEQ ID NO: 1882 and a complementary sequence having SEQ 
ED NO: 1884. Translation of SEQ ID NO: 1882 suggests that nucleic acid molecule 
nCfPL2 1477 encodes a full-length Peritrophin-like protein of 453 amino acids, referred to 
herein as PCfPL2 453 , having an amino acid sequence represented by SEQ ID NO: 1883, 
assuming an initiation codon spanning from nucleotide 3 through nucleotide 5 of SEQ ID 
NO: 1882 and a termination codon spanning from nucleotide 1362 through nucleotide 
1364 of SEQ ID NO: 1882. The coding region encoding PCfPL2 453 , is represented by 
nucleic acid molecule nCfPL2 1359 , having a coding strand with the nucleic acid sequence 
represented by SEQ ID NO: 1885 and a complementary strand with nucleic acid sequence 
represented by SEQ ID NO:1886. The amino acid sequence of SEQ ID NO:1883, 
predicts that PCfPL2 453 has an estimated molecular weight of about 49 kDa and an 
estimated isoelectric point (pi) of about 4.7. 

Comparison of amino acid sequence SEQ ID NO: 1883 with amino acid sequences 
reported in GenBank indicates that SEQ ID NO: 1883 showed the most homology, i.e., 
about 28% identity, with a Drosophila melanogaster locus AE003474 protein (Accession 

# AAF47629). Comparison of SEQ ID NO: 1882 with nucleic acid sequences reported in 
GenBank indicates that SEQ ID NO: 1882 showed the most homology, i.e., about 50% 
identity, Penaeus semisulcatus (a crustacean) peritrophin-like protein 1 cDNA (Accession 

# AF095580). Percent identity calculations were performed using GCG version 9.0 using 
default parameters. 
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Example 16 

This Example describes the further characterization and expression of a 
Peritrophin-like sequence cDNA, referred to herein as PL3, isolated by EST sequencing 
described in Example L 
5 A cDNA designated clone 2240-17 was isolated from the unsubtracted HMT 

library as described in Example 1. Analysis of clone 2240-17 indicated that the cDNA, 
denoted nCfPL3 406 , is about 406 nucleotides in length, having a coding strand with 
nucleic acid sequence SEQ ID NO: 1887 and a complementary sequence having SEQ ID 
NO: 1 889. Translation of SEQ ID NO: 1 887 suggests that nucleic acid molecule 

10 nCfPL3 406 encodes a full-length Peritrophin-like protein of 8 1 amino acids, referred to 
herein as PCfPL3 81 , having an amino acid sequence represented by SEQ ID NO: 1888, 
assuming the initiation codon spans from nucleotide 20 through nucleotide 22 of SEQ ID 
NO: 1887 and the termination codon spans from nucleotide 263 through nucleotide 265 of 
SEQ ID NO: 1887. The coding region encoding PCfPL3 81 , is represented by nucleic acid 

15 molecule nCfPL3 2435 having a coding strand with the nucleic acid sequence represented 
by SEQ ID NO: 1890 and a complementary strand with nucleic acid sequence represented 
by SEQ ID NO: 1891. The amino acid sequence of SEQ ID NO: 1 888, predicts that 
PCfPL3 81 has an estimated molecular weight of about 9.1 kDa and an estimated 
isoelectric point (pi) of about 3.64. 

20 Comparison of amino acid sequence SEQ ID NO: 1888 with amino acid sequences 

reported in GenBank indicates that SEQ ID NO: 1888 showed the most homology, i.e., 
about 34.2% identity, with & Anopheles gambiae peritrophin 1 protein (Accession # 
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AAC39 1 27). Comparison of SEQ ID NO: 1 887 with nucleic acid sequences reported in 
GenBank indicates that SEQ ID NO: 1887 showed the most homology, i.e., about 37% 
identity, with a Anopheles gambiae chloride intracellular channel 2 (Accession # 
AF03043 1). Percent identity calculations were performed using GCG version 9.0 using 
default parameters. 

In order to express the full-length putative PL3 protein, the entire coding region 
was amplified by PCR and then ligated into the E. coli expression vector pTrcHisB, 
available from Invitrogen, as follows. Forward primer PL3FE, which corresponds to 
nucleotides 70-93 of SEQ ID NO: 1887, having the sequence 5' CGG GAT CCC GAA 
TAT GCT GAC GTA GAT GTG TG 3', denoted SEQ ID NO: 1944, and having a BamM 
restriction endonuclease site indicated in bold, was used in conjunction with reverse 
primer PL3RE, which is complementary to nucleotides 245-269 of SEQ ID NO: 1887, 
having the sequence 5' GGA ATT CTG TTT TAT TCT GGT TGG TAA CAT TC 3', 
denoted herein as SEQ ID NO: 1945 and having an EcoRI restriction endonuclease site 
indicated in bold, in a PCR reaction using SEQ ID NO: 1887 as the template under 
standard PCR reaction conditions and the following thermocy cling conditions: (1) 94° C 
for 30 seconds, (2) 25 cycles of 94°C for 10 seconds, 55°C for 10 seconds and 72°C for 3 
minutes. The reaction product was separated on a 1.5% agarose gel, and a band 
corresponding to an approximately 200 nucleotide molecule, as visualized by agarose gel 
electrophoresis and ethidium bromide staining, was cut from the gel and purified using 
the QIAquick Gel Extraction Kit as described above. 

The product of the PCR reaction was the digested with BamHl and EcoRI 
restriction endonucleases, available from New England BioLabs, Inc. for 18 hours at 
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37 * C, purified using the QIAquick Nucleotide Removal Kit, available from Qiagen, and 
ligated into the vector pTrcHisB which had been similarly digested, treated with shrimp 
alkaline phosphatase, available from New England BioLabs, Inc., for 30 minutes at 37°C, 
and purified. Following standard transformation procedures into £. coZ/BL-21 
competent cells, a bacterial clone containing the plasmid pTrcHisB-PL3 was isolated. 
DNA sequence analysis of the clone confirmed that 70-269 of SEQ ID NO: 1887 had been 
successfully ligated into the pTrcHisB expression vector in frame with the N-terminal T7 
Tag epitope encoded by the vector. The recombinant protein encoded thereby is 
predicted to be 97 amino acids in length and have a molecular mass of 10.9 kDa, 
including the T7 Tag and have a pi of 4.08. 

A recombinant PL3 protein was expressed as follows. Five mis of Luria broth 
were innoculated with a glycerol stock of E. coli BL-21 competent cells, available from 
Novagen, Madison, WI, that had been transformed with the pTrcHisB-PL3 plasmid 
prepared as described above and allowed to grow overnight at 37°C under selection with 
100 |ig/ml ampicillin. A 1 ml aliquot of this culture was then used to inoculate 10 mis of 
fresh Luria broth containing 100 ng/ml ampicillin and the culture was allowed to grow to 
an approximate OD reading of 0.5. A 1 ml aliquot of the culture was removed, the cells 
were pelleted by centrifugation and the supernatant discarded. The cells were 
resuspended in a solution of 100 ^1 PBS and 100 \x\ of 2X SDS-PAGE loading buffer 
(100 mM Tris pH 6.8, 4% SDS, 20% glycerol, 0.02% bromophenol blue, and 10% 2- 
mercaptoethanol). Following removal of the 1 ml aliquot described above, IPTG was 
added to the remaining 9 ml culture to a final concentration of 5 mM of IPTG, the culture 
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was incubated at 37°C for an additional 60 minutes, 1 ml was removed and the OD 
measured at approximately 0.6. The cells in this 1 ml sample were then pelleted by 
centrifugation and resuspended in a solution of 120 ^il of PBS and 120 (il of SDS-PAGE 
loading buffer. Equal volumes of the IPTG-induced and uninduced lysates were loaded 
onto a 14% Tris-Glycine SDS-PAGE gel, available from Novex, San Diego, CA. 
Following electrophoresis, the proteins were transferred from the SDS-PAGE gel to a 
nitrocellulose membrane and a Western blot analysis was performed using the T7 tag 
antibody, available from Novagen, which revealed an approximately 18 kDa protein was 
induced by EPTG. The fact that the recombinant PL3 protein ran at a higher molecular 
weight than predicted is consistent with previous published results for other peritrophin 
proteins, and is thought to be due in part to the characteristically low pi of these proteins 
(Tellam et a/., (1999) Peritrophic Matrix Proteins, Insect Biochemistry and Molecular 
Biology, 29:87-101). Sequence analysis of this protein indicates that it contained the N- 
terminal T7 Tag encoded by the vector. 

Four flasks, each containing 1 liter of Luria broth with 100 |ig/ml ampicillin were 
inoculated with a starter culture of 5 ml of E. coli BL-21 cells transformed with the 
pTrcHisB-PL3 plasmid as described above. The cultures were allowed to grow at 37°C 
until the optical density reached approximately 0.500, at which time a 1 ml aliquot was 
removed from each flask as the pre-induction sample. IPTG was added to each 1 liter 
flask to a final concentration of 0.5 mM and the cultures allowed to grow at 37°C for 135 
additional minutes, at which time a 1 ml aliquot was removed from each flask as the post- 
induction sample. The 1 ml aliquots were centrifuged, the supernatants were discarded 
and the pellets were resuspended in 100|al 2X SDS-PAGE loading buffer per each 0.5 
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optical density units measured. The pre-induction and post induction samples were then 
tested for recombinant PL3 protein expression using the standard Western blot techniques 
and the T7 Tag antibody described above. A protein running at approximately 18 kDa 
was detected in the post-induced but not in the pre-induced samples. 

The cells from the remaining 4 liters of culture were centrifuged, the supernatants 
were discarded and the cell pellets were combined and resuspended in 120 mis of buffer 
A (50 mM Tris, PH 8.0, 20 mM NaCl, 1 mM phenylmethylsulfonyl fluoride (PMSF)). 
The sample was then passed through a microfluidizer five times then rocked at 4°C for 20 
minutes. The sample was then centrifuged for 30 minutes and the supernatant collected. 
Western blot analysis of the supernatant showed that the recombinant PL3 protein was 
soluble in the first buffer A extraction. The buffer A supernatant containing the 
recombinant PL3 protein was then further purified by a nickel column, a Q2 anion 
exchange chromatography column, and cation exchange chromatography, using 
techniques well known to those of skill in the art. 
Example 17 

This Example describes the further characterization of a Peritrophin-like sequence 
cDNA, referred to herein as PL4, isolated by EST sequencing described in Example 1 . 

A cDNA designated clone 2244-71 was isolated from the unsubtracted HMT 
library as described in Example 1. Analysis of clone 2244-71 indicated that the cDNA, 
denoted nCfPL4 974 , is about 974 nucleotides in length, having a coding strand with 
nucleic acid sequence SEQ ID NO: 1892 and a complementary sequence having SEQ ID 
NO: 1893. Translation of SEQ ID NO: 1892 suggests that nucleic acid molecule 
nCfPL4 974 encodes a partial-length Peritrophin-like protein of 285 amino acids. 
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Additional sequence corresponding to the 5' end was isolated by PGR using the RACE 
cDNA pool described in Example 3 as the template, as follows. Adapter Primer 1, i.e. 
SEQ ID NO: 1933, was used as the forward primer in conjunction with reverse primer 
PL4-R1, which is complementary to nucleotides 229-251 of SEQ ID NO: 1892, having a 
nucleic acid sequence 5' GAT ATC CAC TTT GAT CAG CGC AC 3', denoted herein as 
SEQ ID NO: 1946 in a PCR reaction under standard PCR reaction conditions and the 
following thermocycling conditions: (1) 94°C for 30 seconds, (2) 5 cycles of 94°C for 10 
seconds and 72°C for 4 minutes, (3) 5 cycles of 94°C for 10 seconds and 70°C for 4 
minutes, (4) 25 cycles of 94°C for 10 seconds then 68°C for 4 minutes. The products of 
this reaction were diluted 1:50 and used as template in a second PCR reaction using 
Adapter Primer 2, i.e. SEQ ID NO: 1935 as the forward primer and reverse primer PL4- 
R2, which is complementary to nucleotides 58-78 of SEQ ID NO: 1892, having a nucleic 
acid sequence 5' GGT ACT ACT CCT GGT GCG GGC 3', denoted herein as SEQ ID 
NO: 1947, using the thermocycling conditions described for the first PCR reaction. The 
products of this reaction were gel purified as previously described and the fragment was 
ligated into the pCR II TA Cloning vector, available from Qiagen, and sequenced to 
reveal of fragment of approximately 150 nucleotides in length. Sequence analysis 
revealed that nucleotides 68-146 of the fragment had 100% identity with nucleotides 1-79 
of nCfPL4 974 . The two sequences were aligned to form a contiguous sequence of about 
1043 nucleotides in length, referred to as nCfPL4 1043 , having a coding strand with SEQ ID 
NO: 1894 and a complementary strand having SEQ ID NO: 1895. However, the 
contiguous sequence does not appear to encode a starting methionine in the predicted 
protein sequence, thus, a second attempt to isolate the remaining coding sequences at the 
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5' end was performed as follows. A first PCR reaction was performed with Adapter 
Primer 1 as the forward primer and PL4-R2 as the reverse primer using the RACE cDNA 
pool as the template under the thermocycling conditions described above. The products 
of this reaction were diluted 1:50 and used as the template in a second PCR reaction 
which used Adapter Primer 2 as the forward primer and reverse primer PL4-R4, which is 
complementary to nucleotides 58-80 of SEQ ID NO: 1894, having the nucleic acid 
sequence 5' CCG TCG ACA TTA AAC TCA CCA TC 3', denoted SEQ ID NO: 1948, 
under the thermocycling conditions described for the first PCR reaction. The products of 
this reaction were gel purified as previously described and the fragment was ligated into 
the pCR II TA Cloning vector, available from Qiagen, and sequenced to reveal of 
fragment of approximately 100 nucleotides in length. Sequence analysis revealed that 
nucleotides 21-101 of the fragment had 100% identity with nucleotides 1-81 of SEQ ID 
NO: 1892. The two sequences were aligned to form a contiguous sequence that is 1062 
nucleotides in length, referred to herein as nCfPL4 1062 , having a coding strand with SEQ 
ID NO: 1 896 and a complementary strand with SEQ ID NO: 1 898. Translation of SEQ ID 
NO: 1896 suggests that nucleic acid molecule nCfPL4 i062 encodes a full-length 
Peritrophin-like protein of 285 amino acids, referred to herein as PCfPL4 285 , having an 
amino acid sequence represented by SEQ ID NO: 1897, assuming the initiation codon 
spans from nucleotide 19 through nucleotide 21 of SEQ ID NO: 1896 and the termination 
codon spans from nucleotide 874 through nucleotide 876 of SEQ ID NO: 1896. The 
coding region encoding PCfPL4 285 , is represented by nucleic acid molecule nCfPL4 855 , 
having a coding strand with the nucleic acid sequence represented by SEQ ID NO: 1899 
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and a complementary strand with nucleic acid sequence represented by SEQ ID NO: 1900. 
The amino acid sequence of SEQ ID NO: 1 897, predicts that PCfPL4 285 has an estimated 
molecular weight of about 3 1 A kDa and an estimated isoelectric point (pi) of about 6.99. 
Comparison of amino acid sequence SEQ ID NO: 1897 with amino acid sequences 
5 reported in GenBank indicates that SEQ ID NO: 1 897 showed the most homology, i.e., 
about 31.5% identity, with a Drosophila melanogaster Gasp precourser (Accession # 
AAD09748). Comparison of SEQ ID NO: 1896 with nucleic acid sequences reported in 
GenBank indicates that SEQ ID NO: 1896 showed the most homology, i.e., about 39.4% 
identity, with a Drosophila melanogaster Gasp precourser (Accession #AF070734 ). 
10 Percent identity calculations were performed using GCG version 9.0 using default 
parameters. 

A Northern Blot analysis was conducted as described in Example 4 to determine 
whether PL4 mRNA is expressed only in certain life stages of the flea life cycle and 
whether PL4 mRNA is expressed only in HMT tissue. Total RNA was extracted from 

15 eggs, first, third, and wandering larvae, pupae, unfed adults, and adults fed on cat blood 
for 0.25, 2, 8, and 24 hours. In addition, total RNA was extracted from hindguts and 
Malpighian tubules extracted from 24 hour cat blood-fed adult fleas, and from the 
remaining body parts following the removal of hindguts and Malpighian tubules. Each 
RNA sample was separated by gel electrophoresis, transferred to nylon membranes and 

20 hybridized with a- 32 P-ATP labeled nCfPL4 974 under the Northern Blotting conditions 
described in Example 4. 
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The results of the Northern blot assay are complex. Although stringent conditions 
were used, several bands with distinct expression patterns were seen. An approximately 
1600 bp message was detected in the egg, first instar, third instar and wandering larval 
stages only. An approximately 1500 bp message was detected in all lifestages and adult 
fed timepoints, but with the strongest signals in the egg, first instar larval, and unfed adult 
stages. A third message, which ran approximately 1200 bp, was detected in the egg, first 
instar larval, pupal, and adult lifestages, including all unfed and fed adult timepoints. All 
three of the messages detected were seen only in the HMT tissues, and were not detected 
in the carcass tissues. 

The detection of three mRNAs instead of one may be the result of the expression 
of three highly homologous transcripts. It has been reported in the literature that 
peritrophin gene families have been found that consist of a number of highly related 
genes (See Schorderet et al, 1998, cDNA and deduced amino acid sequences of a 
peritrophic membrane glycoprotein, 'peritrophin-48', from the larvae of Lucilia cuprina x 
Insect Biochemistry and Molecular Biology 28, 99-1 1 1). It is possible that these 
transcripts represent the products of such a family or that the messages are the RNA 
products of alternative splicing of a single gene locus. 
Example 18 

This Example describes the further characterization of a synaptic vesicle 2B-like 
sequence cDNA, isolated by EST sequencing described in Example 1. 

A cDNA designated clone 2104-59 was isolated from the subtracted HMT library 
as described in Example 1, denoted herein as SEQ ID NO:358. DNA from clone 2104-59 
was purified, and the insert used for plaque hybridization screening of the unsubtracted 
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HMT cDNA library as follows. The insert from clone 2104-59 was excised by digestion 
with EcoKL, separated by agarose gel electrophoresis and purified using the QiaQuick Gel 
Extraction kit, available from Qiagen. A Megaprime DNA labeling kit, available from 
Amersham Pharmacia, was used to incorporate a- 32 P-labeled dATP into the random- 

5 primed probe mix. Hybridization and plaque purification were performed as previously 
described which resulted in the isolation of a clone containing an about 1875 nucleotide 
synaptic vesicle 2B-like sequence, referred to herein as nCfSVPi 875 , having a coding 
strand with nucleic acid sequence SEQ ID NO: 1901 and a complementary sequence 
having SEQ ID NO: 1903. Translation of SEQ ID NO: 1901 suggests that nucleic acid 

10 molecule nCfSVP 1875 encodes a full-length synaptic vesicle 2B-like protein of 530 amino 
acids, referred to herein as PCfSVP 530 , having an amino acid sequence represented by 
SEQ ID NO: 1902, assuming the initiation codon spans from nucleotide 44 through 
nucleotide 46 of SEQ ID NO:1901 and the termination codon spans from nucleotide 1634 
through nucleotide 1636 of SEQ ID NO: 1901. The coding region encoding PCfSVP 530 , is 

1 5 represented by nucleic acid molecule nCfSVP 1590 , having a coding strand with the nucleic 
acid sequence represented by SEQ ID NO: 1904 and a complementary strand with nucleic 
acid sequence represented by SEQ ID NO: 1905. The amino acid sequence of SEQ ED 
NO: 1902, predicts that PCfSVP 530 has an estimated molecular weight of about 58.7 kDa 
and an pi of about 7.61. 

20 Comparison of amino acid sequence SEQ ID NO: 1902 with amino acid sequences 

reported in GenBank indicates that SEQ ID NO: 1902 showed the most homology, i.e., 
about 32% identity, with a Drosophila melanogaster B ACR7A4.y (Accession # 
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CAB51685). Comparison of SEQ ID NO:1901 with nucleic acid sequences reported in 
GenBank indicates that SEQ ID NO: 1901 showed the most homology, i.e., about 39% 
identity, with a Rattus norvegicus synaptic vesicle protein 2B (SVP2B) mRNA 
(Accession # L 10362). Percent identity calculations were performed using GCG version 
9.0 using default parameters. 
Example 19 

This Example describes the further characterization of a Voltage-Gated Chloride 
Channel-like sequence cDNA, isolated by EST sequencing described in Example 1. 

A cDNA designated clone 2108-09 was isolated from the unsubtracted HMT 
library as described in Example 1. Analysis of clone 2108-09 indicated that the cDNA, 
denoted nCfVGCC 381 , is about 381 nucleotides in length, having a coding strand with 
nucleic acid sequence SEQ ID NO: 1906 and a complementary sequence having SEQ ID 
NO:1907. Translation of SEQ ID NO:1906 suggests that nucleic acid molecule 
nCfVGCC 381 encodes a partial-length Voltage-Gated Chloride Channel-like protein of 
126 amino acids. Additional sequence corresponding to the 5' end was isolated by 
hybridization and PCR as follows. 

The insert from clone 2108-09 was excised by digestion with £coRI, separated by 
agarose gel electrophoresis and purified using the QiaQuick Gel Extraction kit, available 
from Qiagen. A Megaprime DNA labeling kit, available from Amersham Pharmacia, 
was used to incorporate a- 32 P-labeled dATP into the random-primed probe mix. 

Hybridization and plaque purification were performed on the unsubtracted HMT cDNA 
library as previously described which resulted in the isolation of a clone containing an 
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about 2191 nucleotide VGCC-like sequence, referred to herein as nCfVGCC 2191 , having a 
coding strand with nucleic acid sequence SEQ ID NO: 1908 and a complementary 
sequence having SEQ ID NO: 1909. Translation of SEQ ID NO: 1908 suggests that 
nucleic acid molecule nCfVGCC 2I91 encodes a partial s VGCC-like protein of 595 amino 
5 acids. 

In order to isolate the remaining coding regions at the 5' end, a PCR was 
performed using the RACE cDNA pool, prepared as described in Example 3, as the 

3 template as follows. Adapter Primer 1 was used as the forward primer in conjunction 

y3 

J3 with reverse primer VGCC-R1, which is complementary to the nucleotides 1482-1503 of 

M, 

jfl 10 SEQ ID NO: 1908, having a nucleic acid sequence 5' CGA TCA TGC GTC TAG CAT 

01 TGG C 3', denoted herein as SEQ ID NO: 1949 under standard PCR reaction conditions 

Q and the following thermocycling conditions: (1) 94°C for 30 seconds, (2) 5 cycles of 

[I 94°C for 10 seconds and 72°C for 4 minutes, (3) 5 cycles of 94°C for 10 seconds and 

U 70°C for 4 minutes, (4) 25 cycles of 94°C for 10 seconds and 68°C for 4 minutes. The 

15 reaction products were separated on an agarose gel and a band corresponding to an 

approximately 1970 nucleotide molecule was isolated, purified using a Gel Purification 
Kit, available from Qiagen, ligated into the pCR II TA cloning vector, available from 
Invitrogen, and sequenced using an ABI PRISM 377 automatic DNA Sequencer. 
Sequence analysis revealed an approximately 1968 nucleotide fragment, referred to as 
20 nCfV GCC l968 , having a coding strand with SEQ ID NO: 1910 and a complementary 
strand with SEQ ID NO: 1911. Sequence analysis also revealed that nucleic acid 
molecule nCfVGCC 196g does not encode a start codon, thus, a second 5' RACE PCR was 
performed as follows in order to isolate additional sequence. Adapter Primer 1 was used 
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as the forward primer in conjunction with reverse primer VGCC-R4 primer which is 
complementary to nucleotides 350-372 of SEQ ID NO: 1910, having a nucleic acid 
sequence 5' CCC GCC CCA GTT CTA GGT TGT CC 3', denoted herein as SEQ ID 
NO: 1950, using the RACE cDNA pool prepared as described in Example 3 as the 
5 template, and the PCR reaction and thermocycling conditions as described for the first 
PCR reaction. The products of this reaction were then diluted 1:50 in water and used as 
the template in a second PCR reaction with Adapter Primer 2 as the forward primer in 
conjunction with reverse primer VGCC-R2, which is complementary to nucleotides 134- 
J 153 of SEQ ID NO:1910, having a nucleic acid sequence 5* CAC ACC CAA CCT GAC 

£ 10 CAG GC 3', denoted herein as SEQ ID NO:1951, under the PCR reaction and 

- 1 thermocycling conditions as described for the first PCR reaction. 

fT The products of this reaction were gel purified as previously described and the 

j T fragment was ligated into the pCR II TA Cloning vector, available from Qiagen, and 

N 8 sequenced to reveal of fragment of approximately 673 nucleotides in length, referred to 

1 5 herein as nCfVGCC 673 , having a coding strand with SEQ ID NO: 19 12 and a 
complementary strand with SEQ ID NO: 1913. Sequence analysis revealed that 
nucleotides 520-673 of the fragment had 100% identity with nucleotides 1-154 of SEQ 
ID NO: 1910. The VGCC fragments were aligned to form a contiguous sequence that is 
3126 nucleotides in length, referred to herein as nCfVGCC 3126J having a coding strand 
20 with SEQ ID NO:1914 and a complementary strand with SEQ ED NO:1916. Translation 
of SEQ ID NO: 1914 suggests that nucleic acid molecule nCfVGCC 3126 encodes a full- 
length VGCC-like protein of 851 amino acids, referred to herein as PCfVGCC 85l5 having 
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an amino acid sequence represented by SEQ ID NO: 1915, assuming the initiation codon 
spans from nucleotide 168 through nucleotide 170 of SEQ ID NO: 19 14 and the 
termination codon spans from nucleotide 2721 through nucleotide 2723 of SEQ ID 
NO: 1914. The coding region encoding PCfVGCC g51 is represented by nucleic acid 
5 molecule nCfVGCC 2553 , having a coding strand with the nucleic acid sequence 

represented by SEQ ID NO: 1917 and a complementary strand with nucleic acid sequence 
represented by SEQ ID NO:1918. The amino acid sequence of SEQ ID NO:1915, 

J3 predicts that PCfVGCC 851 has an estimated molecular weight of about 93.4 kDa and an 

r! estimated pi of about 7.35. 

i7\ 1 0 Comparison of amino acid sequence SEQ ID NO: 1915 with amino acid sequences 

5 reported in GenBank indicates that SEQ ID NO: 1915 showed the most homology, i.e., 

H about 63.1% identity, with a Oryctolagus cuniculus (rabbit) chloride channel protein 3 

(CLCN3) (Accession #AAB95 163). Comparison of SEQ ID NO: 1914 with nucleic acid 
sequences reported in GenBank indicates that SEQ ID NO: 1914 showed the most 
15 homology, i.e., about 61.3% identity, with a Oryctolagus cuniculus chloride channel 

protein 3 (CLCN3) mRNA (Accession # AF029348). Percent identity calculations were 
performed using GCG version 9.0 using default parameters. 

A Northern Blot analysis was conducted as described in Example 4 to determine 
whether VGCC mRNA is expressed only in certain life stages of the flea life cycle and 
20 whether VGCC mRNA is expressed only in HMT tissue. Total RNA was extracted from 
eggs, first, third, and wandering larvae, pupae, unfed adults, and adults fed on cat blood 
for 0.25, 2, 8, and 24 hours. In addition, total RNA was extracted from hindguts and 
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Malpighian tubules extracted from 24 hour cat blood-fed adult fleas, and from the 
remaining body parts following the removal of hindguts and Malpighian tubules. Each 
RNA sample was separated by gel electrophoresis, transferred to nylon membranes and 
hybridized with oc- 32 P-ATP labeled nCfVGCC 381 under the Northern Blotting conditions 
5 described in Example 4. An approximately 3 kB band was detected in all lifestages and 
adult unfed and fed timepoints, however, the intensity of the signal did vary between 
stages with the strongest signals seen in the egg, unfed adult, and 0.25 hour fed adult 
stages, and the weakest signals seen in the 3rd instar larval and pupal stages. A strong 
signal was detectable in the 24 hour fed adult HMT tissues, but only a very weak signal 
1 0 was present in the carcass tissues. 
Example 20 



This Example describes the further characterization and expression of an 
Intersectin-like cDNA isolated by EST sequencing described in Example 2. 

A cDNA designated clone 2225-23 was isolated from the unsubtracted HNC 

15 library as described in Example 2, denoted herein as SEQ ID NO: 121 . A Northern Blot 
analysis was conducted as described in Example 10 to determine whether clone 2225-23 
mRNA is expressed exclusively in HNC tissues. For the hybridization step, a probe 
comprising the flea clone 2225-23 nucleic acid molecule was labeled with a- 32 P-ATP 
using a DNA labeling kit, available from Amersham and added to the buffer at a 

20 ^ concentration of approximately 1 x 10 6 cpm/rnl, and allowed to hybridize for about 14 to 
18 hours at 42°C. The blot was then washed twice for 10 minutes per wash in 0.5X SSPE 
and 0. 1% sarcosyi at 55°C and exposed to film for autoradiography. Analysis of the 



-221- 



FC-6-C1 

developed film showed that there was greater expression of clone 2225-23 mRNA in 
HNC tissues compared to non-HNC tissues, indicating possible upregulation of clone 
2225-23 in flea head and nerve cords. 
Example 21 

5 This Example describes the further characterization and expression of an 

Neuroendocrine Specific Protein C-like cDNA isolated by EST sequencing described in 
Example 2. 

A cDNA designated clone 2249-19 was isolated from the unsubtracted HNC 
library as described in Example 2, denoted herein as SEQ ID NO: 1775. A Northern Blot 

10 analysis was conducted as described in Example 10 to determine whether clone 2249-19 
mRNA is expressed exclusively in HNC tissues. For the hybridization step, a probe 
having the nucleic acid sequence of clone 2249-19 was generated as follows. A PGR 
reaction was conducted using forward primer 2249-19for, having a nucleotide sequence 
5' AGT CGC ATA GTG CAC TTC TGA ATG 3', denoted herein as SEQ ID NO: 1954, 

15 and reverse primer 2249-19rev, having a nucleotide sequence 5' CTG ACA TCT GTT 
TCC ACA GCT C 3', denoted herein as SEQ ID NO:1955, using the HNC cDNA library 
prepared as described in Example 2 as the template under standard PCR reaction 
conditions and the following thermocycling conditions: (1) one minute at 95°C, (2) two 
cycles of 94°C for 10 seconds, 50°C for 20 seconds, and 72°C for 20 seconds, (3) thirty 

20 cycles of 94°C for 10 sec, 53°C for 20 sec, 72°C for 40 sec. The PCR product was ligated 
into the TA vector using a TA cloning kit, available from Invitrogen and the clone was 
digested with EcoRl enzyme, and purified from an agarose gel. The purified nucleic acid 
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molecule was labeled with cc- 32 P-ATP using a DNA labeling kit, available from 
Amersham and added to the buffer at a concentration of approximately 1 x 10 6 cpm/ml, 
and allowed to hybridize for about 14 to 18 hours at 42°C. The blot was then washed 
twice for 10 minutes per wash in 0.5X SSPE and 0.1% sarcosyl at 55°C and exposed to 
5 film for autoradiography. Analysis of the developed film showed that there was 

expression of clone 2249-19 mRNA in HNC tissues and non-HNC tissues with 2 bands 
evident; one at approximately 1.5 Kb and one at approximately 2.5 Kb. 
Example 22 

This Example describes the further characterization and expression of an anoxia 
10 upregulated protein-like cDNA isolated by EST sequencing described in Example 2. 

A TA clone from the HNC EST library described in Example 2 designated clone 
2218-95, denoted herein as SEQ ID NO: 1858 was sequenced using standard sequencing 
methods and shown to contain a non-full length nucleic acid molecule having significant 
homology to anoxia upregulated protein (AUP) genes. Additional sequence encoding an 
15 AUP gene was isolated as follows. A hybridization probe containing the nucleic acid 
sequence of SEQ ID NO: 1 858 was constructed as follows. A PCR reaction was 
conducted using forward primer 2218-95for, having a nucleotide sequence 5' AAT AGT 
GAT GTT GTA AGA GTT AGG 3', denoted herein as SEQ ID NO: 1956, and reverse 
primer 2218-95rev, having a nucleotide sequence 5' GTT TAA TAT TGC ATG TTT 
20 ATT CAT TAA AA 3' , denoted herein as SEQ ID NO: 1957, using the HNC cDNA 

library prepared as described in Example 2 as the template under standard PCR reaction 
conditions and the following thermocycling conditions: (1) one minute at 95°C, (2) thirty 
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cycles of 94°C for 10 sec, 55°C for 20 sec, 72°C for 20 sec. The PCR product was ligated 
into the TA vector using a TA cloning kit, available from Invitrogen and the clone was 
digested with EcoRl enzyme, and purified from an agarose gel. The purified nucleic acid 
molecule was labeled with a- 32 P-ATP using a DNA labeling kit, available from 
Amersham. 

The 32 P cc-dATP labeled probe was used in a standard plaque lift hybridization 
procedure to isolate a clone from the HNC lambda-ZAP unsubtracted cDNA library 
described in Example 2. Hybridization was conducted as described in Example 12 and a 
plaque that hybridized strongly to the probe was isolated, purified and sequenced as 
described in Example 12. Sequencing revealed that the clone contained a nucleic acid 
molecule of about 1181 nucleotides, referred to herein as nCfAUP U81 , having a 
nucleotide sequence denoted herein as SEQ ID NO: 1919. The complement of SEQ ID 
NO:1919 is represented herein as SEQ ID NO:1921. 

Translation of SEQ ID NO: 1919 suggests that nucleic acid molecule nC£AUP U81 
encodes a full-length AUP protein of 102 amino acids, referred to herein as PCfAUP 102 , 
having an amino acid sequence represented by SEQ ID NO: 1920, assuming the initiation 
codon spans from nucleotide 127 through nucleotide 129 of SEQ ID NO: 1919 and the 
termination codon spans from nucleotide 433 through nucleotide 435 of SEQ ID 
NO: 1919. The coding region encoding PCfAUP 102 , is represented by nucleic acid 
molecule nCfAUP 306 , having a coding strand with the nucleic acid sequence represented 
by SEQ ID NO: 1922 and a complementary strand with nucleic acid sequence represented 
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by SEQ ID NO: 1923. The amino acid sequence of PCfAUP 102 , predicts that PCfAUP 102 
has an estimated molecular weight of about 1 1.9 kDa and an estimated pi of about 10.5. 

Comparison of amino acid sequence SEQ ID NO: 1920 with amino acid sequences 
reported in GenBank indicates that SEQ ID NO: 1920 showed the most homology, i.e., 
5 about 52% identity, with a Drosophila melanogaster anoxia upregulated protein, 

GenBank Accession No. AAD38397. Percent identity calculations were performed using 
GCG version 9.0 using default parameters. Blast comparison of nucleic acid sequence 
SEQ ID NO: 1919 with nucleic acid sequences reported in GenBank indicates that SEQ 
ID NO: 19 19 showed the most homology to a clone from human chromosome 14q3 1 

10 region containing gene for neurexin IE, GenBank #AC007056. Pairwise identity could 
not be performed as the human clone in GenBank is too large to load into GCG version 

" " 9.0. \ " " " ~ " ■ 
Example 23 

This Example describes the further characterization of a neuroendocrine specific 
15 7B2 polypeptide, isolated by EST sequencing described in Example 2. 

A cDNA designated clone 221 1-21 was isolated from the subtracted HNC library 
as described in Example 2, denoted herein as SEQ ID NO:92. DNA from clone 221 1-21 
was purified, and the insert used for plaque hybridization screening of the unsubtracted 
HMT cDNA library as follows. The insert from clone 221 1-21 was excised by digestion 
20 with EcoRI, separated by agarose gel electrophoresis and purified using the QiaQuick Gel 
Extraction kit, available from Qiagen. A Megaprime DNA labeling kit, available from 
Amersham Pharmacia, was used to incorporate a- 32 P-labeled dATP into the random- 
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primed probe mix. The 32 P a-dATP labeled probe was used in a standard plaque lift 
hybridization procedure to isolate a clone from the HNC lambda- ZAP unsubtracted 
cDNA library, prepared as described in Example 2. The following hybridization 
conditions were used. Hybond-N filters, available from Amersham, were hybridized with 
about 2 X 10 6 counts per minute (cpm) per ml of the probe in 50 ml of hybridization 
solution (5X SSPE, 25mM EDTA pH 8.0, 5X Denhardt's reagent, 1.2% SDS, 0.020 
mg/mL salmon sperm DNA) at 55°C for about 48 hours. The filters were washed once in 
50mL 4X SSPE, 1% SDS for 15 minutes at 55°C, once in 50 mL 2X SSPE, 1% SDS for 
10 minutes at 55°C, and washed twice in 50 mL 0.5X SSPE, 0.5% SDS for 10 minutes at 
55°C. The filters were then subjected to autoradiography. One plaque that hybridized 
strongly to the probe was isolated and subjected to in vivo excision using the Stratagene 
Ex-Assist™ helper phage system and protocols. Miniprep DNA was prepared from the 
positive clone using a Miniprep kit and protocol, available from Qiagen, Chatsworth, CA, 
and sequenced using standard sequencing procedures. The clone, referred to as 
nCf7B2 2161 contains a nucleic acid molecule of about 2161 nucleotides in length, having a 
coding strand with nucleic acid sequence SEQ ID NO: 1924 and a complementary 
sequence having SEQ ID NO: 1926. 

Translation of SEQ ID NO: 1924 suggests that nucleic acid molecule nC£7B2 2161 
encodes a full-length 7B2-like protein of 267 amino acids, referred to herein as 
PCf7B2 267 , having an amino acid sequence represented by SEQ ID NO: 1925, assuming 
the initiation codon spans from nucleotide 107 through nucleotide 109 of SEQ ID 
NO: 1924 and the termination codon spans from nucleotide 908 through nucleotide 910 of 
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SEQ ID NO: 1924. The coding region encoding PCf7B2 267 , is represented by nucleic acid 
molecule nCf7B2 801 , having a coding strand with the nucleic acid sequence represented 
by SEQ ID NO: 1927 and a complementary strand with nucleic acid sequence represented 
by SEQ ED NO: 1928. The amino acid sequence of SEQ ID NO: 1925, predicts that 

5 PCf7B2 267 has an estimated molecular weight of about 3 1 kDa and an estimated pi of 
about 5. Analysis of PCf7B2 267 suggests the presence of a signal peptide encoded by a 
stretch of amino acids spanning from about amino acid 1 through amino acid 20. The 
proposed mature protein, referred to herein as PCf7B2 247 contains 247 amino acids, 
designated SEQ ID NO: 1930, and is encoded by a nucleic acid molecule referred to as 

10 nCf7B2 741 , having a coding strand with SEQ ID NO: 1929 and a complementary strand 
with SEQ ID NO: 1931. 

Comparison of amino acid sequence SEQ ID NO: 1925 with amino acid sequences 
reported in GenBank indicates that SEQ ID NO: 1925 showed the most homology, i.e., 
about 39% identity, with a Drosophila melanogaster protein, GenBank Accession No. 

15 AAF52036. Percent identity calculations were performed using GCG version 9.0 using 
default parameters. Blast comparison of nucleic acid sequence SEQ ID NO: 1924 with 
nucleic acid sequences reported in GenBank indicates that SEQ ID NO: 1924 showed the 
most homology to a human chromosome 19, cosmid R28204 clone, GenBank #Accession 
No. AC006132. Pairwise identity could not be performed as the human clone in 

20 GenBank is too large to load into GCG version 9.0, however, the BLAST score was 0.20, 
which is not considered to be significant level of identity. 
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A Northern Blot analysis was conducted as described in Example 10 to determine 
whether 7B2 mRNA is expressed exclusively in HNC tissues. For the hybridization step, 
a probe having the nucleic acid sequence of clone 221 1-21 was generated as follows. A 
PCR reaction was conducted using forward primer 221 l-21for, having a nucleotide 

5 sequence 5' GCG CCA TGA AGA TTT CAG GCG 3', denoted herein as SEQ ID 
NO: 1958, and reverse primer 221 l-21rev, having a nucleotide sequence 5' AAG TGC 
AAT GAA TCA TCA GCA AG 3\ denoted herein as SEQ ID NO: 1959, using the HNC 
cDNA library prepared as described in Example 2 as the template under standard PCR 
reaction conditions and the following thermocycling conditions: (1) one minute at 95°C, 

10 (2) five cycles of 94°C for 10 seconds, 50°C for 20 seconds, and 72°C for 20 seconds, (3) 
thirty cycles of 94°C for 10 sec, 53°C for 20 sec, 72°C for 40 sec. The PCR product was 
ligated into the TA vector using a TA cloning kit, available from Invitrogen and the clone 
was digested with EcoRl enzyme, and purified from an agarose gel. The purified nucleic 
acid molecule was labeled with a- 32 P-ATP using a DNA labeling kit, available from 

15 Amersham and added to the buffer at a concentration of approximately 1 x 10 6 cpm/ml, 
and allowed to hybridize for about 14 to 18 hours at 42°C. The blot was then washed 
twice for 10 minutes per wash in 0.5X SSPE and 0. 1% sarcosyl at 55°C and exposed to 
film for autoradiography. Analysis of the developed film showed that after 2.5 days of 
exposure clone 221 1-21 mRNA was expressed exclusively in HNC tissue. 

20 While various embodiments of the present invention have been described in 

detail, it is apparent that modifications and adaptations of those embodiments will occur 
to those skilled in the art. It is to be expressly understood, however, that such 
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modifications and adaptations are within the scope of the present invention, as set forth in 
the following claims: 
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